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(57) Abstract 

The present invention re- 
lates to novel receptor polypep- 
tides, which, upon interaction 
with certain ligands, or activation 
by certain compounds, modulate 
transcription of certain genes by 
binding to cognate response ele- 
ments associated with promoters 
of such genes. The novel recep- 
tors of the invention modulate 
transcription in the presence of 
retinoid compounds. The recep- 
tors of the present invention dif- 
fer significantly from known reti- 
noid acid receptors, in protein 
primary sequence and in respon- 
siveness to exposure to various 
retinoids. The invention provides 
DNAs encoding the novel recep- 
tors, expression vectors for ex- 
pression f the receptors, cells 
transformed with such expres- 
si n vectors, cells co-transformed 
with such expressi n vectors and 
with reporter vectors to monitor 

modulation of transcription by the receptors, and methods of using such co-transformed cells in screening for compounds which 
are capable, directly or indirectly, f activating the receptors. The invention also provides nucleic acid probes for identifying 
DNAs which encode additional retinoid receptors of the same class as the novel receptors disclosed herein. 
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RETINOID RECEPTOR COMPOSITIONS AND METHODS 
RELATED APPLICATIONS 

This application is a c ntinuation-in-part of 
application Serial No. 478,071, filed February 9, 1990, 
now pending, the entire contents of which are hereby 
incorporated by reference herein. 

5 

TECHNICAL FIELD 

The present invention concerns novel, steroid 
hormone-like receptor proteins and methods of making and 
using same. 

10 More particularly, the invention relates to steroid 

hormone-like receptor proteins with transcription- 
modulating effects. Such proteins are responsive to th 
presence of retinoid acid and other vitamin A 
metabolites. 

15 

BACKGROUND OF THE INVENTION 

The retinoids comprise a group of compounds 
including retinoid acid, retinol (vitamin A) , and a 
series of natural and synthetic derivatives that togeth r 

20 exert profound effects on development and differentiation 
in a vide variety of systems. Although early studies 
focused on the effects of retinoids on growth and 
differentiation of epithelial cells, their actions have 
been shown to be widespread. Many recent studies have 

25 examined the effects of these molecules on a variety of 
cultured neoplastic cell types, including the human 
promyelocyte leukemia cell line, HL60, where retinoid 
acid appears to be a potent inducer of granulocyte 
differentiation. In F9 embryonal carcinoma cells, 

30 retinoid acid will induce the differentiation of parietal 
ndoderm, charact ristic of a late m us blastocyst. 
R tinoid acid also appears t play an important role in 
d fining spati -temporal axes in the developing avian 
limb and the regenerating amphibian limb. 
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Retin id acid has b nsh vnt indue th 
transcription £ s v ral cDNAs whose gen products have 
b en isolated by diff rential screening. This 
observation supports th hypothesis that r tinoid acid 
5 exerts its action via modulation of gene expression, in a 
manner analogous to the way in which steroid and thyroid 
hormones influence their target genes. 

The ability to identify compounds which affect 
transcription of genes which are responsive to retinoid 

10 acid or other metabolites of vitamin A, would be of 
significant value, e.g. / for therapeutic applications. 
Further, systems useful for monitoring solutions, body 
fluids and the like for the presence of retinoid acid, 
vitamin A or metabolites of the latter would be of value 

15 in various analytical biochemical applications and, 
potentially, medical diagnosis. 

Through molecular cloning studies it has been 
possible to demonstrate that receptors for steroid, 
retinoid and thyroid hormones are all structurally 

20 related. These receptors comprise a superfamily of 
regulatory proteins that are capable of modulating 
specific gene expression in response to hormone 
stimulation by binding directly to cis-acting elements 
(Evans, Science 240, 889 (1988) ; Green and Chambon, 

25 Trends genet. 4, 309 (1988)). Structural comparisons and 
functional studies with mutant receptors have established 
that these molecules are composed of discrete functional 
domains, most notably, a DNA-binding domain that is 
composed typically of 66 - 68 amino acids (including two 

30 zinc fingers) , and an associated carboxy terminal stretch 
of approximately 250 amino acids which comprises the 
ligand-binding domain (reviewed in Evans, supra). 

Low-stringency hybridization has permitted the 
isolation and subsequent delin ation of a gr wing list of 

35 g ne products which possess th structural f atures of 
hormon recept rs. 



3 

R c ntly, a r tinoid acid dependent transcript! n 
factor, r f rred to as RAR-alpha (r tin id acid receptor- 
alpha) , has been .identified. Subs quently, tv 
additi nal RAR-related genes hav been isolated; thus 
5 there are now at least thre diff rent RAR subtypes 
(alpha, beta and gamma) known to exist in nice and 
humans. These retinoid acid receptors (RARs) share 
homology with the superfamily of steroid hormone and 
thyroid hormone receptors and have been shown to regulat 

10 specific gene expression by a similar ligand-dependent 
mechanism (Umesono et al., Nature 336, 262 (1988)). 
These RAR subtypes are expressed in distinct patterns 
throughout development and in the mature organism. 

Other information helpful in the understanding and 

15 practice of the present invention can be found in 
commonly assigned, co-pending United States Patent 
Application Serial Nos. 108,471, filed October 20, 1987; 
276,536, filed November 30, 1988; 325,240, filed March 
17, 1989; 370,407, filed June 22, 1989; and 438,757, 

20 filed November 16, 1989, all of which are hereby 
incorporated herein by reference in their entirety. 

SUMMARY OF THE INVENTION 

We have discovered novel receptors which are 

25 activated to modulate, transcription of certain genes in 
animal cells, when the cells are exposed to retinoids, 
such as retinoid acid. and retinal. The novel receptors 
differ significantly from known retinoid acid receptors, 
both in terms of the primary protein sequence and 

30 responsiveness to various retinoids. 

The novel receptors have several isoforms located at 
genetically distinct loci. They are capable of 
transactivating through cis elements similar to retinoid 
acid receptors, but show a different rank potency and 

35 d s d pendency to retinoids. Northern analys s f the 
nov 1 receptors of th pr sent inv nti n indicat that 
each is form has a unigu patt m of expression in adult 



tissue and is temporally and spatially express d in the 
embryo. Binding experiments demonstrat that th nov 1 
rec ptor proteins hav a 1 w affinity f r [ s H]retinoic 
acid. Th se r suits, taken tog th r with r suits f r m 
5 transactivation studies,, suggest the ligand(s) for the 
novel receptors is a metabolite (s) or structural 
analog (s) of retinoic acid. The invention provides DMAs 
encoding novel receptors, expression vectors for 
expression of the receptors, cells transformed with such 

10 expression vectors, cells co- trans formed with such 
expression vectors and reporter vectors to monitor 
modulation of transcription by the receptors, and methods 
of using such co-transformed cells in screening for 
compounds which are capable, directly or indirectly, of 

15 activating the receptors. 

The invention also provides single-stranded nucleic 
acid probes for identifying DNAs encoding additional 
retinoid receptors. 

The invention also provides a method for making th 

20 receptors of the invention by expressing DNAs which 
encode the receptors in suitable host organisms. 

Animal cells in which receptors of the invention are 
present can be employed to assay fluids for the presenc 
of retinoids. Animal cells of the invention can also be 

25 employed to screen compounds of potential therapeutic 
value for their ability to bind and/or promote trans- 
activation (i.e., trans-acting transcriptional 
activation) by the receptors of the invention. 

As will be described in greater detail below, the 

30 receptors of the invention modulate transcription of 

genes. This occurs upon binding of receptor to hormone 
response elements, which are positioned operatively, with 
respect to promoters for such genes, for such modulation 
t occur. Am ng horm ne r sp ns lements cont mplated 

35 for use in the practice ^f th pres nt inventi n ar TRE p , 
the b ta-retin id acid r spons el m nt, and th estrogen 
r spons lem nt, as well as closely r lated lements 
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which ar disclosed, for example, in Application Serial 
No. 438,757, filed N vember 16, 1989, and Applicati n 
Serial No. 325,240, filed March 17, 1989. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the extent of amino acid identity 
(i.e., "homology") between the DNA binding domain ("DNA") 
and ligand binding domain ( M RX N ) of mouse RXR-alpha 
(mRXRa) , relative to the corresponding domains of human 

10 retinoic acid receptor- alpha (hRARa) , human estrogen 
receptor (hER) , human thyroid hormone receptor-beta 
(hTR£) and human glucocorticoid receptor (hGR) . 

Figure 2 shows the extent of amino acid identity 
(i.e., "homology") between the DNA binding domain ("DNA") 

15 and ligand binding domain ("LIGAND") of human RXR-alpha 
(hRARa) , relative to the corresponding domains of human 
retinoic acid receptor-beta (hRAR£) , human retinoic acid 
receptor-gamma (hRARf) , hTRfi and hRXRc. 

Figure 3 shows the extent of amino acid identity 

20 (i.e., "homology") between the DNA binding domain ("DNA") 
and ligand binding domain ("RX") df mRXRa, relative to 
the corresponding domains of mouse RXR-beta (mRXR/5) , 
mouse RXR-gamma (mRXRt) and hRXRa. 

Figure 4 illustrates the production of CAT from the 

25 reporter vector (ADH-TREp-CAT) in Drosophila melanogaster 
Schneider line 2 cells, which are co-transformed with 
receptor expression vector A5C-RXR-alpha and are in a 
medium containing various concentrations of retinoic 
acid. 

30 Figure 5 illustrates the differences in 

transcription-activating activities of hRXR-alpha and 
hRAR-alpha, in mammalian cells in culture containing 
different vitamin A metabolites. 

Figure 6, like Figure 5, illustrates the dif f r nc s 

35 in transcription-activating activiti s f hRXR-alpha and 
hRAR-alpha in mammalian c lis in culture c ntaining 
r tin ic acid or different synth tic r tinoids. 
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Figure 7 illustrates the differ nces b tw n hRXR- 
alpha and hRAR-alpha in dos -respons to r tinoic acid in 
media bathing mammalian c lis in which th r cept rs 
occur. Figure 8 illustrat s th differenc s b tw en 
5 mouse RXR-alpha (mRXRa) , mouse RXR-beta (mRXR£) and mous 
RXR-g amma (mRXR-r) in dose response to retinoid acid (RA) 
in media bathing mammalian cells expressing such 
receptors. 

Figure 9 illustrates the differences between mRXRa, 
10 mRXR/3 and idRXRt in dose response to 3,4-didehydroretinoic 
acid (ddRA) in media bathing mammalian cells expressing 
such receptors. 

DETAILED DESCRIPTION OF THE INVENTION 
15 The invention concerns novel polypeptides, which ar 

characterized by: 

(1) being responsive to the presence of retinoid (s) 
to regulate transcription of associated 
gene (s) ; 

20 (2) having a DNA binding domain of about 66 amino 

acids with 9 Cys residues, wherein said DNA 
binding domain has: 

(a) less than about 65 % amino acid identity 
with the DNA binding domain of hRAR-alpha, 

25 

(b) less than about 55 * amino acid identity 
with the DNA binding domain of hTR-beta, 
and 

(c) less than about 55 % amino acid identity 
30 with the DNA binding domain of hGR; and 

(3) not including the sequence set forth in 
Sequence ID No 7. 

The novel polypeptide receptors of the present 
invention can be further charact ri2 d in a variety of 
35 ways, e.g., by increasing the rata f transcription of a 
targ t g n in a construct comprising a promoter 
operatively link d t a h rm n r spons el ment f r 
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transcriptional activation by said r c ptors, relativ to 
th rate f transcription in th abs nee of said r c pt r 
and/or in th absence of r tinoic acid and retinal. 
Transcription of said „targ t g n is measur d in an 
5 animal cell in cultur , the medium f which compris s 
retinoid acid or retinal «t a concentration greater than 
about 5 x 10' 7 M. 

Alternatively, the polypeptide receptors of the 
present invention can *?e further characterized as being 
10 encoded by a continuous nucleotide sequence which encodes 
substantially the same amino acid sequence as that of 
amino acids 1-462 shown in Sequence ID No. 2 [hRXRa], 
amino acids 1-467 shown in Sequence ZD No. 4 [mRXRa], or 
amino acids 1-463 shown in Sequence ID No. 6 [mRXRi]. 
15 As yet another alternative, the polypeptide 

receptors of the present invention can be characterized 
as being encoded by a continuous nucleotide sequence 
which encodes substantially the same amino acid sequenc 
as that of amino acids 135-200 shown in Sequence ID No. 2 
20 [DNA binding domain of hRXRa] ft amino acids 140-205 shown 
in Sequence ID No. 4 £DNA binding domain of mRXRa] , or 
amino acids 139-204 shown in Sequence ID No. 6 [DNA 
binding domain of mRXR?]. 

As still another alternative, the polypeptide 
25 receptor of the present invention can be characterized as 
being encoded by a continuous nucleotide sequence which 
is substantially the same as nucleotides 76-1464 shown in 
Sequence ID Na. 1 [hRXRa] , nucleotides 181-1581 shown in 
Sequence ID No. 3 [mRXRa], or nucleotides 123-1514 shown 
30 in Sequence ID No. 3 [mRXR?]. 

As employed herein, the term "retinoids" refers to 
naturally occurring compounds with vitamin A activity 
synthetic analogs and various metabolites thereof. The 
retinoids are a class jot compounds consisting of four 
35 isopren id units joined in h ad-t -tail manner. 

Numer us r tinoids hav been id ntifi d, as d scrib d, 
for xampl , by Sporn, Roberts and G odman in 
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th tw volum treatise entitled The Retinoids (Academic 
Press, NY, 1984), to which the r ad r is dir cted f r 
furth r d tail. Exemplary retinoids includ retin 1, 
retinyl acetate, retinyl hexadecanoate, o-r tinyl, 4,14- 
5 retroretinol , deoxyretinol , anhydroretinol , 3,4- 

didehydroretinol, 15 , 15-dimethyl retinol, retinyl methyl 
ether, retinyl phosphate, mannosyl retinyl phosphate, 
retinol thioacetate, retinal (retinaldehyde) , 3,4- 
didehydroretinal , retinylidene acetylacetone, 

10 retinylidene-l,3-cyclopentanedione, retinal oxime, 
retinaldehyde acetylhydrazone, retinoic acid, 
4-hydroxyretinoic acid, 4-oxoretinoic acid, 
5,6-dihydroretinoic acid, 5,6-epoxyretinoic acid, 
5,8-epoxyretinoic acid, the open-chain analog of 

15 retinoid acid (i.e., (all-£-3,7,ll,15-tetramethyl-2,4,6, 
8,10, 2,14-hexadecaheptaenoic acid), 7,8- 
didehydroretinoic acid, 7 , 8-dihydroretinoic acid, "C 1S 
Acid" (£, £)-3-methyl-5-(2,6,6-trimethyl-2-cyclohexen-l- 
yl)-2,4-pentanedioic acid), "C 17 Acid" ( (£,£,£) -5-methyl- 

20 7- (2 , 6 , 6-trimethyl-l-cyclohexen-l-yl) -2,4, 6-hepatrienoic 
acid), "C^ Acid" (14»-apo-0, ^-carotenoic acid), retinoic 
acid esters (e.g., methyl ester, ethyl ester, etc.), 
retinoid acid ethylamide, retinoic acid 2- 
hydroxyethylamide, methyl retinone, "C w " Ketone"- ((£,£, 

25 £) -6-nethyl-8- (2,6, 6-trimethyl-l-cyclohexen-l-yl) -3,5,7- 
ocatrien-2-one) , and the like. 

In addition, according to the present invention, 
there are provided DNA sequences which encode novel 
polypeptides as described above. 

30 Further in accordance with the present invention, 

there are provided DNA constructs which are operative in 
animal cells in culture to make said polypeptides. 

According to a still further embodiment of the 
present invention, there are provided animal cells in 

35 cultur which ar transf rm d with DNA constructs (as 
d scrib d abov ) , which ar op rativ in said cells to 
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mak receptor polypeptides, by expr ssi n of DNA segments 
which encode the above described polypeptides. 

Among th animal cells contemplated for us in the 
practice of th pres nt Invention are those which are 
5 f urth r transf ormed with a reporter v ct r which 
comprises: 

(a) a promoter that >s operable in the cell, 

(b) a hormone response element, and 

(c) a DNA segment encoding a reporter protein, 
10 wherein; said reporter protein-encoding DNA 

segment is operatively linked to said promoter 
for transcription of said DNA segment, and 

wherein said hormone response element is 
operatively linked to said promoter for 
15 activation thereof* 

In accordance with the present invention, there is 
also provided a method of testing a compound for its 
ability to regulate the transcription-activating 
properties of the above-described receptor polypeptides, 
20 which method comprises assaying for the presence or 
absence of reporter protein upon contacting of cells 
containing a reporter vector and receptor polypeptide 
with said compound; wherein said reporter vector and said 
receptor polypeptide are as described above. 
25 In accordance with a still further embodiment of th 

present invention, there are provided various probes, 
which can be used to identify genes encoding receptors 
related to those of the present invention. In this 
regard, particular reference is made to Examples V and VI 
30 below. More particularly, the , invention provides 
labeled, single-stranded nucleic acids comprising 
sequences of at least 20 contiguous bases having 
substantially the same sequence as any 20 or more 
contiguous bases selected from: 
35 (i) bas s 2 - 1861, lnclusiv , of th DNA 

illustrat d in Sequenc ID No. 1 [hRXR-a], 
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(ii) bas s 20 - 2095, inclusive, of the DNA 

illustrated in S quenc ID No. 2 [mRXR-a], 
or 

(iii) bases 15 - 1653, inclusiv , f the DNA 

5 illustrated in Sequence ID No. 3 [mRXR-7], 

or 

(iv) the complement of any one of the sequences 
according to (*i) , (ii) , or (iii) . 

As employed herein, the term "labeled single- 

10 stranded nucleic acid sequences" refers to single- 

stranded DNA or RNA sequences which have been modified by 
the addition thereto of a species which renders the 
"labeled" sequence readily detectable from among other 
unmodified sequences. Exemplary labels include 

15 radioactive label (e.g., 32 P, 35 S) , enzymatic label (e.g., 
biotin) , and the like. 

Preferred probes contemplated for use in the 
practice of the present invention are those having at 
least about 100 contiguous bases selected from the above- 

20 described sequences. Especially preferred are probes 
having in the range of about 198 up to several hundred 
nucleotides, because greater selectivity is afforded by 
longer sequences. 

The invention also encompasses a method of making 

25 the above-described receptor polypeptides, which method 
comprises culturing suitable host cells which are 
transformed with an expression vector operable in said 
cells to express DNA which encodes receptor polypeptide. 
Suitable hosts contemplated for use in the practice of 

30 the present invention include yeast, bacteria, mammalian 
cells, insect cells, and the like. E. coli is the 
presently preferred bacterial species. Any of a number 
of expression vectors are well known to those skilled in 
th art that c uld b mployed in th method f th 

35 invent! n. Among th se ar th prokaryotic expr ssi n 
vctrs pNR8A, pNH16A and pNH18A availabl from 
Stratagene, La Jolla, Calif mia USA. 
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Further information on the invention is provided in 
the f 11 wing non-limiting xamples and descripti n f an 
xemplary d posit. 

5 EXAMPLES 

Example X 

The Kpnl/Sacl restriction fragment (503bp) including 
the DNA-binding domain of hRAR-alpha-encoding DNA [See 
Giguere et al., Nature 330, 624 "(1987) ; and commonly 

10 assigned United States Patent Application Serial No. 
276,536, filed November 30, 1988; and European 

Patent Application Publication No. 0 325 849, all 
incorporated herein by reference) was nick-translated and 
used to screen a lambda-gtll human liver cDNA library 

15 (Kvok et al.,Biochem. 24, 556 (1985)) at low stringency. 
The hybridization mixture contained 35% formamide, IX 
Denhardt's, 5X SSPE (IX SSPE-0.15 M NaCl, lOmM Na^IPO^ ImM 
EDTA), 0.1% SDS, 10% dextran sulfate, 100 mg/ml denatur d 
salmon sperm DNA and 10* cpm of ( R P) -labelled probe. 

20 Duplicate nitrocellulose filters were hybridized for 16h 
at 42°C, washed once at 25°C for 15 miri with 2X SSC (IX 
SSO0.15 M NaCl, 0.015 M sodium citrate), 0.1% SDS and 
then washed twice at 55°C for 30 min. in 2X SSC, 0.1% SDS. 
The filters were autofaaiographed for 3 days at -70°C 

25 using an intensifying screen. 

Positive clones were isolated, subcloned into pGEM 
vectors (Promega, Madison, Wisconsin, USA), restriction 
mapped, and re-subcloned in various sized restriction 
fragments into M13mpl8 and M13mpl9 sequencing vectors. 

30 DNA sequence was determined by the dideoxy method with 
Seguenase* sequencing kit (United States Biochemical, 
Cleveland, Ohio, USA) and analyzed by University of 
Wisconsin Genetics Computer Group programs (Devereux 
et al., Nucl. Acids Res" 12, 387 (1984)). A unique 

35 receptor-like sequence was idehtifi d and designated 
lambda-HL3-l . />v 
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Lambda-HL3-1 was us d as a hybridization probe to 
rescr n a lambda-gtlO human kidney cDHA library (Arriza 
et al. # Science 237, 268 (1987)) which produc d several 
clones, the long st of which was s quenc d and designat d 
5 lambda-XR3-l. The DNA sequence obtained as an EcoRI- 

fragment from lambda-XR3-l has the sequence indicated in 
Sequence ID No. 1 [hRXRa]. 

Similar screening of a mouse whole embryo library 
with the full-length hRXR-alpha clone described above 
10 provided additional sequences which encode different 
isoforms of the human RXR-alpha receptor. In addition, 
the mouse homolog (mouse RXR-alpha) was also identified 
in this way. 

Thus, mRNA was isolated from 14.5 day post-coitus 
15 (p.c.) mouse embryos, translated into cDNA, linkered with 
EcoRI/NotI linkers, then inserted into the unique EcoRI 
site of the cloning vector A-ZAP (Stratogene) . The 
resulting library was screened at reduced stringency with 
^P-labeled, full length hRXR-alpha as the probe. 
20 The DNA sequences of the resulting clones are set 

forth as Sequence ID No. 3 [mRXRa] and Sequence ID No. 5 
[mRXR?) . 

Example II 

25 Amino acid sequences of mRXR-alpha, hRAR-alpha 

(human retinoic acid receptor-alpha), hER (human estrogen 
receptor) hTR-beta (human thyroid hormone receptor-beta) 
and hGR (human glucocorticoid receptor) were aligned 
using the University of Wisconsin Genetics Computer Group 

30 program "Bestfit" (bevereux et al., supra). Regions of 
significant similarity between mRXR-alpha and the other 
receptors, i.e., the 66 - 68 amino acid DNA binding 
domains and the ligand-binding domains, are presented 
schematically in Figur 1 as perc nt amino acid id ntity. 

35 Similarly, th amino acid s qu nces of human RAR- 

alpha (hRARa) , human RAR-beta (hRAR0) , human RAR-gamma 
(hRAR 7 ) , human TR-b ta (hTR0) and human RXR-alpha (hRXRa) 
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were align d. As done in Figur 1, regions f 
significant similarity betw en hRAR-alpha and the oth r 
r c pt rs are pr s nt d schematically in Figur 2 as 
percent amin acid identity. 
5 A furth r comparison of r ceptors is set forth in 

Figure 3. Thus, the amino acid sequences of mouse RXR- 
alpha (mRXRot) , mouse RXR-beta (mRXR£) , mouse RXR-gamma 
(BRXR7) and human RXR-alpha (hRXRa) were aligned, and th 
percent amino acid identity presented schematically in 

10 Figure 3. 

Although the DNA-binding domains of both mRXR-alpha 
and hRXR-alpha are conserved relatively veil with respect 
to other receptors (such as hRAR-alpha and hTR-beta) , the 
ligand binding domain is poorly conserved. (See Figures 

15 1 and 3) • A comparison between the retinoic acid 

receptor subfamily of receptors and hRXR-alpha reveals 
nothing to suggest that hRXR-alpha is related to any of 
the known retinoid receptors (Fig. 2) • 

20 Example III 

Drosophila melanogaster Schneider line 2 ("S2 N ) 
cells (Schneider, Embryol* Exp. Morphol. 27, 353 (1972), 
which are readily available, were seeded at 2 x 10 6 per 35 
mm 2 culture dish and maintained in Schneider medium 

25 (GIBCO/Life Technologies, Inc., Grand Island, New York, 
USA) supplemented with penicillin, streptomycin and 12% 
heat- inactivated fetal bovine serum (Irvine Scientific, 
Santa Ana, California, USA) . The cells were transiently 
co-trans fected with 10 /ig/dish of plasmid DNA by calcium 

30 phosphate precipitation '(Krasnow et al., Cell 57, 1031 
(1989): 4.5 /ig/dish of receptor expression vector or 
control construct (producing no hRXR-alpha); 0.5 pg/dish 
of reporter plasmid or control reporter plasmid; 0.5 
//g/dish of reference plasmid; and 4.5 pg inert plasmid 

35 DNA. 

In th r c pt r xpression v ct r, A5C-RXR-alpha 
(4.5 /ig/dish), r c ptor hRXR-alpha is const itutively 
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expr ss d in the S2 cells under the control of the 
Drosophila actin 5C promoter (A5C; Thummel t al M Gene 
74 : 445 (1988)) driving transcription of the EcoRI-site- 
bound d ins rt of lambda-XR3-l. In th control vector, 
5 A5C-RXR rcv (also 4.5 fig/ml) , the EcoRI-site-bounded insert 
from lambda-XR3-l is inserted in the reverse (i.e., non- 
coding or nbn-sense-coding) orientation. 

A5C-RXR-alpha was made by first inserting at the 
unique BamHI site of A5C a linker of sequence: 



10 



5 ' -GATCCGATATCCATATGGAATTCGGTACCA , 



and then inserting, at the EcoRI site of the linker 
(underlined above) , the EcoRI-site-bounded insert of 
15 lambda -XR3-1 (See Example I) . 

The reporter plasmid ADH-TRE p -CAT (at 0.5 jig/dish) 
contains the palindromic thyroid hormone response element 
TREp, having the sequence: 



20 5 1 -AGGTCATGACCT 

[(Glass et al. Cell 54, 313 (1988); Thompson and Evans, 
Proc. Natl. Acad. Sci. (USA) 86, 3494 (1989)], inserted 
into position -33 (with respect to the transcription start 

25 site) of a pD33-ADH-CAT background (Krasnow et al., Cell 
57, 1031 (1989)). 

pD33-ADH-CAT is a plasmid with the distal promoter of 
the Drosophila melanogaster alcohol dehydrogenase gene 
linked operably for transcription to the bacterial (E. 

30 coli) chloramphenicol' acetyl trans f erase ("CAT") gene, a 
gene for the indicator protein CAT. ADH-TREp-CAT was made 
by inserting the oligonucleotide of sequence: 



35 



5 1 -CTAGAGGTCATGACCT 

TCCAGTACTGGAGATC- 5 1 
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into the Xbal sit at positi n -33 in pD33-ADH-CAT. pD33- 
ADH-CAT , without TREp, served as a control reporter (i.e. # 
background) plasmid. 

A r f er nee plasmid encoding beta-galactosidas 
5 driven by the act in 5 C promoter also was trans fected (0.5 
/ig/dish) along with pGEM DNA :? (4.5 /tg/dish) (Promega, 
Madison , Wisconsin) to make up the final DNA concentration 
to 10 pig/dish. The reference plasmid was made by 
inserting a BamHI-site bounded, beta-galactosidase- 

10 encoding segment into the unique BamHI site of A5C. The 
purpose of the reference plasmid was to normalize results 
for transfection efficiency* 

Twenty-four hours post-transfection, various 
retinoids were added to the cultures. The retinoids were 

15 dissolved in dimethyl -sulfoxide and/ or ethanol and the 
resulting solution was added to 0.1 % v/v of culture 
medium. Initial concentration of the retinoids in the 
culture media was 10" 6 M, except for the experiments 
underlying the data displayed in Figure 4, for which 

20 varying concentrations of retinoic acid were used. 

In control runs, ethanol, at 0.1 % v/v in the medium, 
was used in place of a solution of retinoid. 

Cultures were maintained in the dark for 36 hr after 
addition of retinoid and then harvested. All other parts 

25 of the experiments, involving retinoids, were carried out 
in subdued light. 

Cell lysates were centrifuged. Supematants were 
assayed for beta-galactosidase, following Herbomel et al., 
Cell 39, 653-662 (4984) , and units/ml of beta- 

30 galactosidase activity was calculated. CAT assays 
(normalized to beta-galactosidase activity) of 
supernatants were incubated for 75 unit-hours ("units" 
referring to units of beta-galactosidase activity), as 
described by Gorman et al., Nol. Cell. Biol. 2, 1044 

35 (1982), usually 150 units f r 30 minutes. 

No hRXR-alpha dep nd nt activation of CAT xpr ssion 
was noted in any exp rim nt in which control reporter was 
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used in place of ADH-TREp-CAT. Similarly, essentially no 
activation was observed for runs wher control plasmid, 
A5C-hRXR rtv , was us d in plac of A5C-hRXR. 

The induction of CAT activity in retinoid-treat d 
cells was compared with induction in untreated (i.e., only 
ethanol-treated) cells. Induction was measured in the 
presence of retinoic acid (RA) , retinal (RAL) , retinol 
acetate (RAC) , retinol (ROH) , and retinol palmitate (RP) . 
The production of chloramphenicol acetyltransferase (CAT) 
from the reporter vector (ADH-TREp-CAT) was measured in 
Drosophila melanogaster Schneider line 2 cells, co-trans- 
formed with the hRXR-alpha expression vector A5C-RXR- 
alpha, and exposed to a medium to which retinoic acid 
(RA) , retinal (RAL) , retinol acetate (RAC) , retinol (ROH) , 
or retinol palmitate (RP) has been added to a 
concentration of 10* 6 M. The relative induction observed 
was RA > RAL > RAC > ROH > RH. 

In Figure 4 are displayed the results, also expressed 
in terms of "fold-induction" of CAT activity, as described 
in the previous paragraph, with retinoic acid at a number 
of different concentrations, to show the "dose response" 
of hRXR-alpha (in trans-activation at TREp in insect 
cells) to retinoid acid in the medium of the cells. 

Example IV 

This example, describing experiments similar to those 
described in Example III, shows that hRAR-alpha and hRXR- 
alpha differ significantly in their properties, 
specifically with respect to trans-activation of 
transcription from promoters. 

The mammalian receptor-expression vector RS-hRAR- 
alpha, from which hRAR-alpha is produced under control of 
the 5'-LTR promoter of the rous sarcoma virus proviral 
DNA, is described in Giguere et al., Natur 330, 624 
(1987) ; c mm nly asign d United Stat s Pat nt Application 
Serial No. 276,536, filed November 30, 1988; and European 
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Patent Application Publication No* 0 325 849 , all 
incorporated herein by reference. 

Th rec ptor-expression vector RS-hRXR-alpha is 
constructed similarly to RS-hRAR-alpha, by inserting the 
EcbRI-site-bounded, hRXR-alpha-encoding segment of lambda- 
XR3-1 into plasinid pRS (Giguere et al. f Cell 46, 645 
(1986)). 

Control plasmid pRSns is pRS with a non-sense-coding 
sequence inserted in place of receptor-coding sequence. 

Reporter plasmid delta-MTV-TREp-CAT , also known as 
TREplMCAT, has also been described (Umesono et al., Nature 
336, 262 (1988), Thompson and Evans, supra., see also 
Umesono and Evans, Cell 57, 1139 (1989) . When a control 
reporter, designated delta -MTV-CAT, which is substantially 
delta-MTV-TREp-CAT with TREp removed, was used in place of 
delta-MTV-TREp-CAT, no CAT activity was found with either 
receptor with any of the* retinoids or retinoid analogs. 

Reference plasmid, RS-beta-galactosidase, is also 
known and is substantially the same as RS-hRAR-alpha and 
RS-hRXR-alpha but has a beta-galactosidase-encoding 
segment in place of the receptor-encoding segment. 

Culture of CV-1 ceils, co-transf ections (with 
reporter plasmid, receptbr-exprfession-plasmid or control 
plasmid, reference plasmid and inert plasmid DNA) and CAT 
assays were performed as described in Umesono et al., 
Nature 336, 262 (1988). Co-transf ections and CAT assays 
were carried out by methods similar to those described in 
Example III. Similar to the experiments in Example III, 
subdued light was used. 

When CV-1 cells co- transformed with reporter plasmid 
(delta-MTV-TREp-CAT) , reference plasmid, control plasmid 
(i.e., expressing no receptor), and receptor plasmid 
(RS-hRAR-alpha or RS-hRXR-alpha), were exposed to 
retinoids RA, RAL, RAC, ROH, RP, (which are naturally 
occurring vitamin A m tabolit *s) , or r tinoid-fr 
ethanol, th results shown in Figur 5 w r obtain d. The 
Figur illustrates production f CAT from reporter plasmid 
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in monkey kidney cells f the CV-1 line, which are co- 
transformed with hRXR-alpha-producing expression v ctor 
RS-hRXR-alpha or hRAR-alpha-producing expr ssion vector 
RS-hRAR. Experiments are carri d out in a m dium to which 
5 RA, RAL, RAC, ROR, or RP has been added to a concentration 
of 10"* M. The bars over the W - M sign indicate the levels 
of CAT production when the cells are exposed to no 
retinoid (i.e., retinoid- f ree ethanol) . The hatched bars 
indicate the level of CAT production when a control 

10 expression vector, from which no receptor is expressed, is 
employed in place of the receptor expression vector. The 
open bars indicate the level of CAT production when 
receptor-producing expression vector is employed. In 
each case, the retinoids were added as ethanolic 

15 solutions, with the volume of solution 0.1 % (v/v) in the 
medium. Retinoid- free ethanol was added to 0.1 % v/v. 
Results are plotted as percentages of the maximal response 
observed in the experiments, i.e., hRXR-alpha with RA. 

In Figure 6, the results are provided for experiments 

20 carried out ' as described in the previous paragraph but 
with, in place of RAL, RAC, R0H and RP, the synthetic 
retinoids 4- (5 , 6 , 7 , 8-tetrahydro-5 ,5,8, 8-tetramethyl-4- 
iodo-2-antrhracenyl) -benzoic acid ("Rl"), ethyl -P-[ (E)-2- 
(5,6,7, 8-tetrahydro-5 ,5,8, 8-tetramethyl-2-naphthyl ) -1- 

25 propenyl] -benzoic acid ("R2") , ethyl-all trans-9-(4- 
methoxy-2 , 3 , 6-trimethyl ) -3 , 7 -dimethyl -2 ,4,6,8- 
nonatetranoate ("R3"), and ethyl-all trans-9-(4-methoxy- 
2,3, 6-trimethyl ) -3 , 7-dimethyl-2 ,4,6, 8-nonatetraenoic acid 
( "R4 " ) initially at a concentration of 10' 6 M. The Figure 

30 illustrates production of CAT from the reporter plasmid 
( del ta-MTV-TREp-CAT ) , CV-1 cells, which are co-transformed 
with hRXR-alpha-producing expression vector RS-hRXR-alpha 
or the constitutive hRAR-alpha-producing expression vector 
RS-hRAR. Experiments are carried out in a medium to which 

35 RA, Rl, R2, R3, or R4 has b en added to a c ncentration f 
10** M. Th bars over the "- w sign indicate the 1 vels f 
CAT production when the c lis are exposed to no retinoid. 
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The hatched bars indicate th level f CAT production when 
a control expression vector, from which no rec ptor is 
express d, is employ d in plac of the r c ptor expression 
vector. The open bears indicat the level of CAT 
5 production when r ceptor-producing expression v ctor is 
employed. 

In Figure 7, results are presented for experiments 
carried out as described in this Example using various 
concentrations of retinoic acid. The Figure illustrates 
10 production of CAT from the reporter plasmid (delta -MTV- 
TRE p -CAT) , in CV-1 cells , which are co-transformed with the 
receptor-producing expression vector RS-RXR-alpha or RS- 
RAR-alpha. Experiments are carried out in a medium to 
which RA has been added to various concentrations. In the 
15 Figure, the results are in terms of fold-induction 
observed with cells exposed to RA, and control cells 
(exposed to only RA-free ethanol) • 

In Figure 8, results are presented for experiments 
carried out as described above, using various 
20 concentrations of retinoic acid with expression vectors 
encoding mRXR-alpha, mRXR-beta: and mRXR-gamma . 

In Figure 9, results are presented for experiments 
carried out as described above, using various 
concentrations of 3, 4-didehydroretinoic acid (ddRA) with 
25 expression vectors encoding mRXR-alpha, mRXR-beta and 
mRXR-gamma • 

Bfcample V 

To determine the distribution of hRXR-alpha gene 
30 expression, poly A* RNAs- isolated from a variety of adult 
rat tissues were size fractionated, transferred to a nylon 
filter, and hybridized with hRXR-alpha cDNA. 

Thus, for each tissue of adult male rat that was 
analyzed, total RNA was • prepared from the tissue (see 
35 Chomczynski and Sacchi,~Anal. Bidfch m. 162, 156 (1987)) 
and poly A* s 1 cted by oligo(dT)-c llulos chromatography. 
T n micrograms of poly A* RNA were separated by 1% agarose- 
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fonnaldehyd g 1 lectrophoresis, transf rred to a Nytran 
filter (Schleicher and Schuell) (see McDonnell et al., 
Sci nc 235, 1214 (1987)), and hybridized under string nt 
conditions, with the hRXR-alpha-encoding, EcoRI insert of 
5 lambda-XR3-l. Hybridization was performed at 42°C in a 
buffer containing 50% formamide, 5X Denhardt • s , 5X SSPE, 
0.1% SDS, lOOmg/ml salmon sperm DNA, 200mg/ml yeast RNA, 
and [ 32 P] -labelled probe. The filter was then washed twice 
with 2X SSC, 0.1% SDS at 22°C and twice at 50°C. 

10 Autoradiography was for 24h at -70°C with an intensifying 
screen. RNA ladder size markers from Bethesda Research 
Laboratories (Gaithersburg, Maryland, USA) 

The distribution of RXR-alpha mRNA in the rat reveals 
a pattern of expression distinct from that of the retinoid 

15 acid receptors (Giguere et al., Nature 330, 624 (1987); 
Zelent et al., Nature 339, 714 (1989); Benbrook, Nature 
333, 669 (1988)). The rat RXR-alpha message appears to be 
a single species of about 4.8 kbp (kilobase pairs) which 
is expressed in many tissues, but most abundantly in the 

20 liver, muscle, lung, and kidney and somewhat less 
abundantly in adrenal, heart, intestine, and spleen. 

Example VI 

Molecular cloning analyses of the thyroid hormone and 
25 retinoic acid receptor genes indicate that each of these 
receptors belongs to a discreet gene subfamily which 
encode several receptor isoforms. To determine if this 
was also true of RXR, a series of Southern blot analyses 
were carried out. High stringency hybridization of 
30 restriction endonuclease-digested human DNA with a 
labelled DNA fragment derived from lambda-XR3-l produced 
a similar number of bands in every digestion, consistent 
with a single genetic locus. When the hybridization 
conditi ns were r lax d, howev r, many additional bands 
35 wr bsrvdinth products of each nzyme digestion. 
Car ful inspecti n of this hybridizati n pattern 
demonstrat d that it is unrelat d to a similar analysis 
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described for hRAR-alpha (Giguere et al., Nature 330, 624 
(1987) . These observations indicate the presence of at 
least one other locus J?h th human g nome related to the 
hRXR-alpha g n . Further, a genomic dna zooblot 
representing mammalian, avian, yeast, and Drosophila 
species was obtained. Thus far, the RXR gene family 
appears to be present in all species tested except yeast, 
which to date has not been shown to contain any members of 
the steroid receptor super family. 

For the analyses of human DNA, two human placenta 
genomic DNA Southern blots were prepared in parallel with 
identical DNA samples. The blots were hybridized at high 
or low stringency with a *1200 bp [ 32 P] -labelled fragment 
of lambda-XR3-l which included the coding portions of the 
DNA and ligand binding domains (Sequence ID No. l, 
nucleotides 459-1631) . 

For the zooblot, genomic DNA from human, monkey, rat, 
mouse, dog, cow, rabbit, chicken, S. cerevisiae and 
Drosophila melanogaster were hybridized at low stringency 
with a "330 bp [^P] -labelled fragment of lambda-XR3-l 
which included the DNA-bihding domain (Sequence ID No. 1, 
nucleotides 459-776). Differently sized bands (in 
comparison with HihdIII-digested lambda DNA for sizing) 
were found for the varidus species.' The blots for all of 
the species (including both for D. melanogaster) , except 
yeast, mouse and rabbit ■ appeared to have more than one 
band . 

For the analysis of human DNA, the placental DNA was 
restricted with BamHI, Bglll, EcoRI, Hindlll, PstI and 
PvuII, separated in a 0.8% agarose gel (10 fig per lane) 
and transferred to nitrocellulose £see McDonnell et al., 
supra) and hybridized as described below. 

For the zooblot, EcoRI -digested DNA from the several 
species (Clontech, Palo Alto, California, USA) , other than 
D. melanogaster, was used for Southern blot analysis.. 
EcoRI- and Xhol-dig sted D. melanogast r DNA was included 
also. 
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Bl ts w re hybridized at 42 # C in the low stringency 
buffer described in Example I or at high stringency in th 
same buffer modified by addition of formamide to 50 %. 
Low stringency blots wer wash d twic at room texnperatur 
5 and twice at 50 # C in 2X SSC, 0.1% SDS. The high 
stringency blot vas washed twice at room temperature in 2X 
SSC, 0.1% SDS and twice at 65 # C in 0.5X SSC, 0.1% SDS. 

Example VII 

10 

Northern analysis were carried out on the mouse RXR 
isoforms alpha, beta and gamma, to determine the tissue 
distribution of these receptors in adult tissues and in 
developing embryos. 

15 Thus, mRNA (10/xg) was isolated from various adult rat 

tissues of from day 10.5-day 18.5 p.c. whole mouse 
embryos. These samples were subjected to Northern 
analysis using 32 P-labeled cDNA probes derived from regions 
specific to mRXRa, mRXR£, or mRXRy. 

20 In the adult, the various RXR isoforms are seen to be 

expressed in both a specific and overlapping distribution 
pattern. 

In the embryo, the various isoforms are highly 
expressed in what appears to be a specific temporal 
25 pattern. 

The invention has been described in detail with 
particular reference to preferred embodiments thereof, but 
it will be understood that variations and modifications 
can be effected within the spirit and scope of the 
30 invention. 
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Deposit 

On January 31, 1990, a sample of replicatable 
phagescript SK doubl -stranded DNA (Stratagene, La Jolla, 
California, USA), with the 1860 base-pair, EcoRI-site- 
5 bounded DNA, the sequence of which is illustrated in 

Figure 1, inserted at the unique EcoRI site, was 
deposited under the terms of the Budapest Treaty on the 
International Recognition of the Deposit of Microorganisms 
for the Purposes of Patent Procedure at the American Type 
10 Culture Collection, Rockville, Maryland, USA ("ATCC"). 
The accession number assigned to' this deposit is ATCC 
40741. The deposited DNA is designated pSK(hRXR-alpha) . 

Phagescript SK double-stranded DNA is a modified 
M13mpl8 bacteriophage DNS (double-stranded) . Derivatives, 
15 such as pSK(hRXR-alpha) , of phagescript SK double-stranded 
DNA can be cloned in the same way as M13mpl8 and its 
derivatives. 

Samples of pSK(hRXR-alpha) will be publicly available 
from the ATCC without restriction, except as provided in 

20 37 CFR 1.801 et seq. , at the latest on the date an 
United States Patent first issues on this application or 
a continuing application thereof. Otherwise, in 
accordance with the Budapest Treaty and the regulations 
promulgated thereunder, samples will be available from the 

25 ATCC to all persons legally entitled to receive them under 
the law and regulations of any country or international 
organization in which an application, claiming priority of 
this application, is filed or in which a patent based on 
any such application is granted. 
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SUMMARY OF SEQUENCES 

Sequence ID No. 1 is th coding sequence of an EcoRI- 
site-bounded DNA s gment which enc des the novel receptor 
disclosed her in, referred to as human RXR-alpha [hRXRa] 

5 

Sequence ID No. 2 is the amino acid sequence of the 
novel receptor referred to herein as hRXRa. 

Sequence ID No. 3 is the nucleotide (and amino acid) 
sequence of the novel receptor disclosed herein, referred 
10 to as mouse RXR-alpha [mRXRa] . 

Sequence ID No. 4 is the amino acid sequence of the 
novel receptor referred to herein as mRXRa. 

Sequence ID No. 5 is the nucleotide (and amino acid) 
sequence of the novel receptor disclosed herein, referred 
15 to as mouse RXR-gamma [mRXRy]. 

Sequence ID No. 6 is the amino acid sequenase of the 
novel receptor referred to herein as mRXRy. 

Sequence ID No. 7 is the nucleotide sequence of the 
receptor disclosed by Hamada, et al in PNAS £f: 8298-8293 
20 (1989) . This receptor is similar to the receptor referred 
to herein as mRXR0. 
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SCO ID 110:1: 

GAATTCCCGC GCCCGCCCCC G C CCCCCCGC CGCCCCCTCC CT CCGCCGC C GGCC6GCCA T 60 

5 GAGTTAGTCG CAGAC ATG GAC ACC AAA CAT T7C CTG CCG CTC CAT TTC TCC 111 
Met Asp Thr Lys Hie Phe Leu Pro Leu Asp Phe Ser 
15 10 

ACC CAG GTG AAC tCC TCC CTC ACC TCC CCG ACG GGG CGA GGC TCC ATG 159 

10 Thr 6ln Val Asn Ser Ser Leu Thr Ser Pro Thr 6ly Arg Sly Ser Net 
15 20 25 

GCT GCC CCC TCC CTG CAC CCG TCC CT8 GGG CCT GGC ATC GGC TCC CCG 207 
Ale Ale Pro Ser Leu Mis Pro Ser Leu Gly Pro Gly lie Sly Ser Pro 
15 30 35 40 
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40 



60 



GGA CAG CTG CAT TCT CCC ATC AGC ACC CTG AGC TCC CCC ATC AAC GGC 255 
Gly Gin Leu His Ser Pro lie Ser Thr Leu Ser Ser Pro He Asn Gly 
45 50 55 60 

ATG GGC CCG CCT TTC TCG GTC ATC AGC TCC CCC ATG GGC CCC CAC TCC 303 
Met Gly Pro Pro Phe Ser Vat He Ser Ser Pro Mat Gly Pro Bis Ser 
65 70 75 



25 ATG TCG GTG CCC ACC ACA CCC ACC CTG GGC TTC AGC ACT GGC AGC CCC 351 
Met Ser Val Pro Thr Thr Pro Thr Leu Gly Phe Ser Thr Gly Ser Pro 
80 85 90 

CAG CTC AGC TCA CCT ATG AAC CCC GTC AGC AGC AGC GA6 GAC ATC AAG 399 
3 0 Gin Leu Ser Ser Pro Met Asn Pro Val Ser Ser Ser Glu Asp He Lys 
95 100 105 

CCC CCC CTG GGC CTC AAT GGC GTC CTC AAA GTC CCC GCC CAC CCC TCA 447 
Pro Pro Leu Gly Leu Asn Gly Val Leu Ly* Val Pro Als His Pro Ser 
35 110 115 120 

GGA AAC ATG GCT TCC TTC ACC AAG CAC ATC TGC GCC ATC TGC GGG GAC 495 
Gly Asn Met Ala Ser Phe Thr Lys His He Cys Ala He cys Gly Asp 
125 130 135 140 



CCC TCC TCA GGC AAG CAC TAT GGA GTG TAG AGC TGC GAS GGG TGC AAG 543 
Are Ser Ser Gly Lys His Tyr Gly Val Tyr Ster Cys Glu Gly Cys Lys 
145 WIT 155 



45 GGC TTC TTC AAG CGG ACG GTG CCC AAG GAC CTG ACC TAC ACC TGC CGC 591 
Gly Phe Phe Lys Arg Thr Val Are Lys Asp' Leu Thr Tyr Thr Cys Arg 
160 165 170 

GAC AAC AAG GAC TGC CTG ATT GAC AAG CQG CAG CGG AAC CGG TGC CAG 639 
50 Asp Asn Lys Asp Cys Leu He Asp Lys Arg Gin Arg Asn Arg Cys Gin 
175 180 185 

TAC TGC CGC TAC CAG AAG TGC CTG GCC ATG G6C ATG AAG CGG GAA GCC 687 
Tyr Cys Are Tyr Gin Lys Cys Leu Ale Net Gly Met Lys Arg Glu Ale 
55 190 195 200 

GTG CAG GAG GAG CGG CAG CGT GGC AAG GAC CGG AAC GAG AAT GAG GTG 735 
Val Gin Glu Glu Arg Gin Arg Gly Lys Asp Arg Asn Glu Asn Glu Val 
205 210 215 220 



GAG TCG ACC AGC AGC GCC AAC GAG GAC ATG CCG GTG GAG AGG ATC CTG . 783 
Glu Ser Thr Ser Ser Ale Asn Glu Asp Met Pro Val Glu Arg He Leu 
225 230 235 



65 GAG GCT GAG CTG GCC GTG GAG CCC AAG ACC GAG ACC TAC GTG GAG GCA 831 
Glu Ale Glu Leu Ala Val Glu Pro Lye Thr Glu Thr Tyr Val Glu Ale 

240 245 250 

AAC ATG GGG CTG AAC CCC AGC TCG CCG AAC GAC CCT GTC ACC AAC ATT 879 
70 Asn Net Gly Leu Asn Pro Ser Ser Pro Asn Asp Pro Val Thr Asn II 

255 260 265 
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TCC CAA GCA GCC GAC AAA CAG CTT TTC ACC CTG GTC CAC TG6 GCC AAG 927 
Cys Gin Ala Ala Asp Lym Gin Leu Phe Thr Leu Val Glu Trp Ala Lys 

270 275 280 

5 CGG ATC CCA CAC TTC TCA GAG CTG CCC CTG GAC GAC CAG GTC ATC CTG 975 
Arg I la Pro His Phe Ser Glu Leu Pro Leu Asp Asp Gin Val lie Leu 

225 290, 295 300 ... 

CTG CGG GCA GCC TGG AAT GAG CTG CTC ATC GCC TCC TTC TCC CAC CGC 1023 
10 Leu Arg Ala Gly Trp Asn Glu Leu Leu He Ala Ser Pha Ser His Arg 
305 310 315 

TCC ATC GCC CTG AAG GAC GGG ATC CTC CTG GCC ACC GGG CTG CAC GTC 1071 
Ser lie Ala Val Lys Asp Gly He Leu Leu Ala Thr Gly Leu His Val 
15 320 325 330 

CAC CGG AAC AGC GCC CAC ACC GCA GGG GTG CGC GCC ATC TTT GAC AGG 1119 
His Arg Asn Ser Ala His Ser Ala Gly Val Gly Ala He Pha Asp Arg 

335 340 345 



20 



40 



GTG CTG ACG GA6 CTT GTG TCC AAG ATG CGG GAC ATG CAG ATG GAC AAG 1167 
Val Leu Thr Glu Leu Val Ser Lys Net Arg Asp Net Gin Net Asp Lys 
350 355 360 



25 ACG GAG CTG GGC TGC CTG CGC GCC ATC GTC CTC TTT AAC CCT GAC TCC 1215 
Thr Glu Leu Gly Cys Leu Arg Ala He Val Leu Phe Asn Pro Asp Ser 
365 370 375 380 

AAG GGG CTC TCC AAC CCG GCC GAG GTG GAG CCC CTG AGG GAG AAG GTC 1263 
30 Lys Gly Leu Ser Asn Pro Ala Glu Val Glu Ala Leu Arg Glu Lys Val 
385 390 395 

TAT GC6 TCC TTG GAG GCC TAC TGC AAG CAC AAG TAC CCA GAG CAG CCG 1311 
Tyr Ala Ser Leu Glu Ala Tyr Cys Lys His Lys Tyr Pro Glu Gin Pro 
35 400 405 410 

GGA AGG TTC CCT AAG CTC TTG CTC CGC CTG CCG GCT CTG CGC TCC ATC 1359 
Gly Arg Phe Ala Lys Leu Leu Leu Arg Leu Pro Ala Leu Arg Ser He 
415 420 425 



GGG CTC AAA TGC CTG 6AA CAT CTC TTC TTC TTC AAG CTC ATC GGG GAC 1407 
Gly Leu Lys Cys Leu Glu His Leu Phe Phe Phe Lys Leu He Gly Asp 
430 435 440 



45 ACA CCC ATT GAC ACC TTC CTT ATG GAG ATG CTG GAG GCC CCG CAC CAA 1455 
Thr Pro He Asp Thr Phe Leu Net Glu Net Leu Glu Ala Pro His Gin 
445 450 455 460 

ATG ACT TAGGCCTGCG GGCCCATCCT TTGTGCCCAC CCGTTCTGGC CACCCTGCCT 1511 
50 Net Thr 



GGACGCCAGC TGI! CTT CTC AGCCTGAGCC CTGTCCCTGC CCTTCTCTGC CTGGCCTGTT 1571 

55 TGGACTTTG6 GGCACAGCCT GTCACTGCTC TGCCTAAGAfi AT G TGTTGT C ACCCTCCTTA 1631 

TTTCTGTTAC TACTTGTCT6 TGGCCCAGGG CAGTGGCTTT CCTGAGCAGC AGCCTTCGT6 1691 

GCAAGAA C TA GCGTGAGCCC A G C C AGG CC C CT CCCC A CC 6 GGCTCTCAGG A C G C C C TCCC 1751 

60 

ACACCCACGG GGCTTGGGCG ACTACAGGGT CTTCCGCCCC AGCCCTGGAG CTGCAGGAGT 1811 

TGGGAACGGG C CTTTT G m CCGTTGCTGT TTATCGATCC TGGTTTTCAG AATTC 1866 
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SCO ID M0:2: 

Met Asp Thr Lys Hts Ph Leu Pr Leu Asp Phe Sep Thr Gin Val Asn 
15 10 15 

Ser Ser Leu Thr Ser Pro Thr Gly Arg Gly Ser Net Ale Ale Pro Ser 
20 25 30 

Leu Hie Pro Ser Leu Gly Pro Gly He Gly Ser Pro Gly Gin Leu Mis 
35 40 45 
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20 



Ser Pro He Ser Thr Leu Ser Ser Pro He Asn Gly Net Gly Pro Pro 

50 55 % 60 

Phe Ser Vel He Ser Ser Pro Net Gly Pro Ms Ser Net Ser Vsl Pro 
65 70 75 80 

Thr Thr Pro Thr Leu Gly Phe Ser Thr Gly Ser Pro Gin Leu Ser Ser 
85 90 95 

Pro Net Asn Pro Vel Ser Ser Ser Glu Asp He Lys Pro Pro Leu Gly 
100 105 110 



Leu Asn Gly Vel Leu Lys Vel Pro Ala His Pro Ser Gly Asn Net Ala 
25 115 120 125 



30 



35 



Ser Phe Thr Lys His He Cys Ala He Cys Gty Asp Arg Ser Ser Gly 
130 135 HO 

Lys His Tyr Gly Val Tyr Ser Cys Glu Gly Cys Lys Gly Phe Phe Lys 
145 150 155 160 

Arg Thr Val Arg Lys Asp Leu Thr Tyr Thr Cys Arg Asp Asn Lys Asp 
165 170 175 

Cys Leu 1 le Asp Lys Arg Gin Arg Asn Arg cys Gin Tyr Cys Arg Tyr 
180 185 190 



40 



Gin Lys Cys Leu Ala Net Gly Net Lys Arg Glu Ala Val Gin Glu Glu 
195 200 205 



45 



Arg Gin Arg Gly Lys Asp Arg Asn Glu Asn Glu Val Glu Ser Thr Ser 
210 215 220 

Ser Ale Asn Glu Asp Net Pro Val Glu Arg He Leu Glu Ala Glu Leu 

225 230 235 240 



50 



Ala Val Glu Pro Lys Thr Glu Thr Tyr Val Glu Ala Asn Net Gly Leu 
245 250 255 

Asn Pro Ser Ser Pro Asn Asp Pro Val Thr Asn He Cys Gin Ala Ala 
260 265 270 



Asp Lys Gin Leu Phe Thr Leu Val Glu Trp Ala Lys Arg He Pro Mis 
55 275 280 285 



60 



Phe Ser Glu Leu Pro Leu Asp Asp Gin Vsl He Leu Leu Arg Ale Gly 

290 295 300 

Trp Asn Glu Leu Leu He Ala Ser Phe Ser His Arg Ser He Ala Val 

305 310 315 320 



65 



70 



Lys Asp Gly He Leu Leu Ala Thr Gly Leu Hfs Val His Arg Asn Ser 

325 330 335 

Ala His Ser Ala Gly Val Gly Ala He Phe Asp Arg Val Leu Thr Glu 

340 345 350 

Leu Vel Ser Lys Net Arg Asp Net Gin Net Asp Lys Thr Glu Leu Gly 

355 360 365 



Cys Leu Arg Ala He Val Leu Phe Asn Pro Asp Ser Lys Gly Leu Ser 

370 375 .380 



WO 91/12258 



28 

Asn Pro Ala 6lu Vat Glu Ala Leu Arg Glu Lye Val Tyr Ala Ser Leu 

385 390 395 400 

Glu Ala Tyr Cy» Lya if a Lya Tyr Pro Glu Gin Pro Gly Arg Phe Ala 
5 405 410 415 

Lya Leu Leu Leu Are Leu Pro Ala Leu Are Ser I la Gly Leu Lya Cye 
420 425 430 

10 Leu Glu lie Leu Phe Phe Phe Lya Leu He Gly Aap Thr Pro lie Asp 
435 440 445 

Thr Phe Leu Net Glu Net Leu Glu Ala Pro 81 a Gin Net Thr 
450 455 460 

15 



29 



SEQ ID M0:3: 

GAATTCGCGG CCCCGCCG AC TTTTGCAACA ACTCGCCGCG CCGCGGCCTC CSCCCGCCCC 60 

5 CCCCGCCCCT G CC C CC C C C6 6CTCCCCCCC 6CCC66GCCC C6CCCCCGCC CCCCCS6 GCC 120 

CCCCCCCGC T GCCGCCCTGC T GC T CCSCCS CCSGgTCGGC ATGAGTTA6T CGGAGAC 177 

ATG GAC ACC AAA CAT TTC CTG CCG CTC GAC TTC TCT ACC CAS 6TG AAC 225 
10 Net Asp Thr Lys Hit Phe Lou Pro Leu Asp Phe Ser Thr Gin Val Asn 
15 10 15 

TCT TCG TCC CTC AAC TCT CCA ACS GST CGA GGC TCC ATG GCT GTC CCC 273 
Ser Scr Ser Leu Am Ser Pro Thr Gly Arg Gty Ser Net Ala Vai Pro 
15 20 25 30 

TCG CTG CAC CCC TCC TTG GGT CCG GGA ATC GGC TCT CCA CTG GGC TCG 321 
Ser Leu Nfs Pro Ser Leu Gly Pro Gly lie Gly Ser Pro Leu Sly Ser 
35 40 45 



20 



40 



60 



CCT CCG CAG CTG CAC TCT CCT ATC AGC ACC CTQ AGC TCC CCC ATC AAT 369 
Pro Gly Gin Leu His Ser Pro lie Ser Thr Leu Ser Ser Pro tie Asn 
50 55 60 



25 GGC ATG GGT CCG CCC TTC TCT GTC ATC AGC TCC CCC ATG GGC CCS CAC 417 
Gly Net Gly Pro Pro Phe Ser Val He Ser Ser Pro Met Gly Pro His 
65 70 75 80 

TCC ATG TCG GTA CCC ACC ACA CCC ACA TTG GGC TTC GGG ACT GGT AGC 465 
30 Ser Net Ser Val Pro Thr Thr Pro Thr Leu Gly Phe 6ly Thr Gly Ser 
85 9CT 95 

CCC CAG CTC AAT TCA CCC ATG AAC CCT GTG AGC AGC ACT GAG GAT ATC 513 
Pro Gin Leu Asn Ser Pro Net Asn Pro Val Ser Ser Thr GUf Asp lie 
35 100 105 110 



AAG CCG CCA CTA GGC CTC AAT GGC GTC CTC AAG GTT CCT GCC CAT CCC 561 
Lys Pro Pro Leu Gly Leu Asn 6ly Val Leu Lys Val Pro Ala His Pro 
115 120 125 

TCA GGA AAT ATG GCC TCC TTC ACC AAG CAC ATC T6T GCT ATC TGX GGG 609 
Ser Gly Asn Net Ala Ser Phe Thr Lys Bis- lie Cys Ala lie Cys Gly 
130 135 140 



45 GAC CGC TCC TCA GGC AAA CAC TAT GGG GTA TAC ACT TCT GAG GGC TGC 657 
Asp Arg Ser Ser Gly Lys His Tyr Gly Val Tyr Ser Cys Gitf Wy cys 
145 150 155 160 

AAG GGC TTC TTC AAG AGG ACA GTA CGC AAA GAC CTG ACC TAC ACC TGC 705 
50 Lys Gly Phe Phe Lys Arg Thr Val Arg Lys Asp Leu Thr Tyr Thr Cys 
165 178 175 

CGA GAC AAC AAG GAC TGC CTG ATC GAC AAG AGA CAG CCG AAC CCG TGT 753 
Arg Asp Asn Lys Asp Cys Leu lie Asp Lys Arg Gin Arg Asn Arg Cys 
55 180 185 190 



CAG TAC TGC CGC TAC CAG AAG TGC CTG GCC ATG GGC ATG AAG CCG GAA 801 
Gin Tyr Cys Arg Tyr Gin Lys Cys Leu Ala Net Gly Net Lys Arg Glu 
195 200 205 

GCT GTG CAG GAG GAG CCG CAG CG6 GGC AAG fiAC .CCG AAT GAG AAC GAG 849 
Ala Val Gin Glu Glu Arg Gin Arg Gly Lys Asp Arg Asn Glu Asn Glu 

210 215 r- 220 

65 GTG GAG TCC ACC AGC ACT GCC AAC GAG GAC ATG CCT GTA GAG AAG ATT 897 
Val Glu Ser Thr Ser Ser Ala Asn Glu Asp Net Pro Val Glu Lys lie 

225 230 ,235 240 

CTG GAA GCC GAG CTT GCT GTC GAG CCC AAG ACT GAG ACA TAC GTG GAG 945 
70 Leu Glu Ala Glu Leu Ala Val Glu Pro Lys Thr Glu Thr Tyr Val Glu 
245 25ft 255 
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GCA AAC ATG GG6 CTG AAC CCC AGC TCA CCA AAT GAC CCT 6TT ACC AAC 993 
Ala Asn Net Gty Uu Asn Pro Ser Ser Pro Asn Asp Pro Vat Thr Asn 

260 265 270 

5 ATC TGT CAA GCA GCA GAC AAG CAG CTC TTC ACT CTT GT6 6AG TGG CCC 1041 
II Cys Gin Alb Alt Asp Lys Gin Leu Phe Thr Leu Val Glu Trp Ala 
275 280 285 

AAG AGG ATC CCA CAC TTT TCT GAG CTG CCC CTA GAC GAC CAG GTC ATC 1089 
10 Lys Arg lie Pro Ufa Phe Ser Glu Leu Pro Leu Asp Asp Gin Val lie 
^ 290 295 300 

CTG CTA CGG CCA GGC TGG AAC GAG CTG CTG ATC GCC TCC TTC TCC CAC 1137 
Leu Leu Arg Ala Gly Trp Asn Glu Leu Leu He Ala Ser Phe Ser lie 
15 305 310 315 320 

CSC TCC ATA GCT GT6 AAA GAT GGG ATT CTC CTG GCC ACC GGG CTG CAC 1185 
Arg Ser He Ala Val Lys Asp Gly lie Leu Leu Ala Thr Gly Leu His 
325 330 335 



20 



40 



60 



65 



70 



GTA CAC CGG AAC AGC GCT CAC ACT GCT GGG GTG GGC GCC ATC TTT GAC 1233 
Vel His Arg Asn Ser Ale His Ser Ale Gly Val Gly Ala He Phe Asp 
340 345 350 



25 AGG GTG CTA ACA GAG CTG GTG TCT AAG ATG CGT GAC ATG CAG ATG GAC 1281 
Arg Val Leu Thr Glu Leu Val Ser Lys Met Arg Asp Net Gin Net Asp 

355 360 365 

AAG ACG GAG CTG GGC TGC CTG CGA GCC ATT GTC CTG TTC AAC CCT GAC 1329 
30 Lys Thr Glu Leu Gly Cys Leu Arg Ala He Val Leu Phe Asn Pro Asp 
370 375 380 

TCT AAG GGG CTC TCA AAC CCT GCT GAG GTG GAG GCG TTG AGG GAG AAG 1377 
Ser Lys Gly Leu Ser Asn Pro Ala Glu Val Glu Ala Leu Arg Glu Lys 
35 385 390 395 400 

GTG TAT GCG TCA CTA GAA tSCG TAC TGC AAA CAC AAG TAC CCT GAG CAG 1425 
Val Tyr Ala Ser Leu Glu Ala Tyr Cys Lys His Lys Tyr Pro Glu Gin 
405 410 415 



CCC GGC AGG TTT GCC AAG CTG CTG CTC CGC CTG CCT GCA CTG CGT TCC 1473 
Pro Gly Arg Phe Ala Lys Leu Leu Leu Arg Leu Pro Ala Leu Arg Ser 
420 425 430 



45 ATC GGG CTC AAG TGC CTG GAG CAC CTG TTC TTC TTC AAG CTC ATC GGG 1521 
He Gly Leu Lys Cys Leu Glu His Leu Phe Phe Phe Lys Leu He Gly 
435 440 445 

GAC ACG CCC ATC GAC ATX TTC CTC ATG GAG ATG CT6 GAG GCA CCA CAT 1569 
50 Asp Thr Pro He Asp Thr Phe Leu Net Glu Net Leu Glu Ala Pro His 
450 455 460 

CAA GCC ACC TAGGCCCCCG CCGCCGTGTG CCGGTCCCCT GCCCTCCCTG 1618 
Gin Ala Thr 
55 465 



GACACAGCTG 


CTCAGCTCCA 


GCCCTGCCCC 


TGCCCTTTCT 


GATGCCCCGT 


GTGGATCTTT 


1678 




GGGGTGCAGT 


GTCCTTATGG 




TGCATCACCA 


TCCTCCCCAT 


CTTTACTCAT 


1738 




GCTTGCCTTT 




ATAGCAGAGC 


TGGTGTGACA 


CCTGGCCAGC 


TCCTGCCCTA • 


1798 




CATCAGGCTC 


TAAGGCTATG 


CTGCTCTCAC 


CCCGAGGGTC 


GTGGGGTTC6 


TCATGCGGCC 


1858 


• 


TTCAGCACCT 


GGAGCTGCAA 


6AGCTGCGAA 


AAGGGCTTGT 


TCTGGTTGCT 


GGTTGCTGGT 


1918 




CCCTGCTTCT 


CGACATCCCA 


CATGGCACCT 


CTGTTTGGAG 


TGCCCCATCT 


TGGCCTGTTC 


1978 




AGAGTCCTG6 


TACCCAGTTA 


GGGTGGGAAT 


CCACCTGGGA 


TCAAGAAGGA 


GCAGGTGGGG 


2038 




CAGGCC6TAT 


CCTCCTGGGT 


CATAGCTAAC 


CTATAAAGGC 


GCCGCGAATT 


CCTCGAG 


2095 





31 

SCO ID K)t4l 

Net Asp Thr Lys Hit Phe Leu Pro Leu Asp Phe Ser Thr Gin Val Asn 
1 5 10 15 

O 

Ser Ser Ser Leu Asn Ser Pro Thr Gly Arg Gly Ser Net Ala Val Pro 
20 25 30 

Ser Leu His Pro Ser Leu Gly Pro Gly lie Gly Ser Pro Leu 6ly Ser 
10 35 40 45 

Pro Gly Gin Leu His Ser Pro He Ser Thr Leu Ser Ser Pro He Asn 
50 55 60 

15 Gly Met Gly Pre Pro Phe Ser Vel lie Ser Ser Pro Net Gly Pro His 
65 70 75 80 



20 



35 



50 



65 



Ser Net Ser Vel Pro Thr Thr Pro Thr Leu Gly Phe Gly Thr Gly Ser 
85 90 95 

Pro Gin Leu Asn Ser Pro Net Asn Pro Vsl Ser Ser Thr 6lu Asp He 
100 105 110 



Lys Pro Pro Leu Gly Leu Asn Gly Val Leu Lys Val Pro Ala His Pro 
25 115 120 125 

Ser 6ly Asn Net Ala Ser Phe Thr Lys His He Cys Ala He Cys Gly 
130 135 HO 

30 Asp Arg Ser Ser Gly Lys His Tyr Gly Val Tyr Ser Cys Glu Gly Cys 
145 150 155 160 



Lys Gly Phe Phe Lys Arg Thr Val Arg Lys Asp Leu Thr Tyr Thr Cys 
165 170 175 

Arg Asp Asn Lys Asp Cys Leu I le Asp Lys Arg Gin Arg Asn Arg Cys 
1«0 185 190 

Gin Tyr Cys Arg Tyr Gin Lys Cys Leu Ala Net Gly Net Lys Arg Glu 
4 0 195 200 205 

Ala Val Gin Glu Glu Arg 6ln Arg Gly Lys Asp Arg Asn Glu Asn Glu 
210 215 220 

45 Val Glu Ser Thr Ser Ser Ala Asn Glu Asp Net Pro Val Glu Lys He 
225 230 235 240 



Leu Glu Ala Glu Leu Ala Val Glu Pro Lys Thr Glu Thr Tyr Val Glu 

245 250 255 

Ala Asn Net Gly Leu Asn Pro Ser Ser Pro Asn Asp Pro Val Thr Asn 
260 265 270 



He Cys Gin Ala Ala Asp Lys 6ln Leu Phe Thr Leu Val Glu Trp Ala 
55 275 280 285 

Lys Arg He Pro His Phe Ser Glu Leu Pro Uu Asp Asp Gin Val He 

290 295 300 

60 Leu Leu Arg Ala 6ly Trp Asn Glu Leu Leu lie Ala Ser Phe Ser His 
305 310 315 320 



Arg Ser He Ala Val Lys Asp Gly He Leu Leu Ala Thr Gly Leu If a 

325 330 335 

Val Its Arg Asn Ser Ala His Ser Ala Gly Val Gly Ala Ho Phe Asp 

340 345 350 



Arg Val Leu Thr Glu Leu Val Ser Lys Net Arg Asp Net Gin Net Asp 

70 355 360 365 

Lys Thr Glu Leu Gly Cys Leu Arg Ala II Val Leu Phe Asn Pro Asp 
370 375 380 
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Ser Lys 6ly Leu Ser Asn Pro All Gtu Val Glu Ala Lau Arg Glu Lys 
385 390 395 400 

Val Tyr Ala Ser Lau Clu Ala Tyr Cy» Lys Hia Lys Tyr Pro Glu Gin 
5 405 410 415 

Pro Gly Arg Phe Ala lys Leu Leu Leu Arg Lag Pro Ala Leu Arg Ser 
420 425 430 

10 lie Gly Lau Lys Cya Lau Glu His Leu Phe Phe Phe Lys Lau lie Gly 
435 440 445 

Asp Thr Pro lie Asp Thr Phe Lau Net Glu Net Lau Glu Ala Pro Ufa 
450 455 460 

15 

Gin Ala Thr 

465 



33 



10 



30 



50 



70 



SEO ID NO:5: 

GAATTCGCGG CCGCCCTGTG CCTGGGAGCC GAGAGAGAGA GAGAGAGAGA GAGAGAGAGA 60 

GAGAGAGAGA GAGAGGCTGT ACTCTTCAGA AGC6CACGA6 AGGAATGAAC TGAfiCACCCA 120 

AC ATG TAT GGA AAT TAT TCC CAC TTC ATG AA6 TTT CCC ACC GGC TTT 167 
Met Tyr Gly Asn Tyr Ser His Phe Hot Lys Phe Pro Thr Gly Phe 
1 5 * 10 15 

GGT GGC TCC CCT GGT CAC ACT GGC TCG AC6 TCC ATG AGC CCT TCA GTA 215 
Gly Gly Ser Pro Gly His Thr Gly Ser Thr Ser Met Ser Pro Ser Vel 
20 25 30 



15 GCC TTC CCC ACS GGG AAG CCA ATG GAC AGC CAC CCC AGC TAC ACA GAC 263 
Ale Leu Pro Thr Gly Lys Pro Met Asp Ser His Pro Ser Tyr Thr Asp 
35 40 <5 

ACC CCA 6TG ACT GCC CCT CGG ACG CTG ACT J6CT CTG GGA ACC CCC CTC 311 
20 Thr Pro Val Ser Ale Pro Arg Thr Leu Ser Ale Vel Gly Thr Pro Leu 
50 55 60 - 

AAT GCT CTT GGC TCT CCG TAT AGA GTC ATC ACT TCT GCC ATG GGT CCA 359 
Asn Ale Leu Gly Ser Pro Tyr Arg Val He Thr Ser Ale Met Gly Pro 
25 65 70 75 

CCC TCA GGA GCA CTG GGA GCT CCT CCA GGA ATC AAC TTG GT6 GCT CCA 407 
Pro Ser Gly Ale Leu Ala Ala Pro Pro Gly He Asn Leu Val Ala Pro 
80 85 90 95 



CCC AGC TCC CAG CTA AAT GTG GTC AAC ACT GTC AGC AGC TCT GAG GAC 455 
Pro Ser Ser Gin Leu Asn Vel Val Asn Ser Vel Ser Ser Ser Glu Asp 
100 105 116 



35 ATC AAG CCC TTA CCA GGT CTG CCT GGG ATT GGA AAT ATG AAC TAC CCA 503 
He Lys Pro Leu Pro Gly Leu Pro Gly He Cly Asn Met Asn Tyr Pro 
115 120 125 

TCC ACC AGC CCT GGG TCT CTG GTG AAA CAC ATC TCT GCC ATC TGT GGG 551 
40 Ser Thr Ser Pro Gly Ser Leu Val Lys His He Cys Ale He Cys Gly 
130 135 140 

GAC AGA TCC TCA GGG AA6 CAC TAC GGT GTG TAC AGC TGT GAA GGT TGC 599 
Asp Arg Ser Ser Gly Lys His Tyr Gly Val Tyr Ser Cys Glu Gly Cys 
45 145 150 155 

AAA GGC TTC TTC AAA AGG ACC ATC AGG AAA GAT CTC ATC TAC ACC TGT 647 
Lys Gly Phe Phe Lys Arg Thr He Arg Lys Asp Leu He Tyr Thr Cys 
160 165 170 175 



CGG GAT AAC AAA GAT TGT CTC ATC GAC AAG CCC CAG CSC AAC CEC TGC 695 
Arg Asp Asn Lys Asp Cys Leu He Asp Lys Arg Gin Arg Asn Arg Cys 
180 185 190 



55 CAG TAC TCT CGC TAC CAG AAG TGC CTG GTC ATG GGC ATG AAG CGG GAA 743 
Gin Tyr Cys Arg Tyr Gin Lys Cys Leu Vel Met Gly Met Lys Arg Glu 
195 200 205 

GCT GTG CAA GAA GAA AGG CAG AGG AGC CGA GAG CGA GCA GAG ACT GAG 791 
60 Ale Vel Gin Glu Glu Arg Gin Arg Ser Ar» Glu Arg Ale Glu Ser Glu 
210 215 220 

GCA GAA TGT GCC AGT AGT AGC CAC GAA GAC ATG CCC GTG GAG AGG ATT 839 
Ale Glu Cys Ale Ser Ser Ser His Glu Asp Net Pro Vel Glu Arg He 
65 225 230 235 

CTA GAA GCC GAA CTT GCT GTG GAA CCA AAG ACA GAA TCC TAC GGT GAC 887 
Leu Glu Ala Glu Leu Ale Vel Glu Pro Lys thr Glu Ser Tyr Gly Asp 
240 245 250 255 



ATG AAC GTG GAG AAC TCA ACA AAT GAC CCT GTT ACC AAC ATA TGC CAT 935 
Met Asn Vsl Glu Asn Ser Thr Asn Asp Pro Val Thr Asn He Cys His 
260 265 270 
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GCT GCA GAT AA6 CAA CTT TTC ACC CTC 6TT CAC TGC CCC AAA CGC ATC 983 
Ala Ala Asp Lys Gin Ltu Phe Thr Leu Val Clu Trp AU Lys Arg It 

275 280 285 

5 CCC CAC TTC TCA GAT CTC ACC TTG GAG GAC CAS 6TC ATT CTA CTC CSS 1031 
Pro Hit Phe Ser Asp Ltu Thr Leu Glu Asp Gin Val lie Leu Leu Arg 

290 295 . 300 

GCA GGG TGG AAT GAA CTG CTC ATT GCC TCC TTC TCC CAC CGC TC6 6TT 1079 
10 Ala Gly Trp Asn Glu Leu Leu lie Ala Ser Phe Ser Mis Arg Ser Val 
305 310 315 

TCC GTC CAS GAT CGC ATC CTG CTG GCC ACG GGC CTC CAC GTS CAC AGG 1127 
Ser Val Gin Asp Gly lie Leu Leu Ala Thr Gly Leu Ufa Val lis Arg 
15 320 .325 330 335 

AGC ACC GCT CAC AGC CG6 GGA GTC GGC TCC ATC TTC GAC AGA GTC CTT 1175 
Ser Ser Ala iis Ser Arg Gly Val Gly Ser lie Phe Asp Arg Val Leu 
340 345 350 



20 



40 



50 



ACA GAS TTG GTG TCC AAG ATG AAA GAC ATG CAG ATG GAT AAG TCA GAG 1223 
Thr Glu Leu Val Ser Lys Net Lys Asp Net Gin Net Asp Lys Ser Glu 

355 360 365 



25 CTG GGG TGC CTA CCC GCC ATC GTG CTG TTT AAC CCA GAT GCC AAG GGT 1271 
Leu Gly Cys Leu Arg Ala lie Val Leu Phe Asn Pro Asp Ala Lys Gly 
370 375 380 

TTA TCC AAC CCC TCT GAG GTG GAG ACT CTT CCA GAG AAG 6TT TAT GCC 1319 
30 Leu Ser Asn Pro Ser Glu Val Glu Thr Leu Arg Glu Lys Val Tyr Ale 
385 390 395 

ACC CTG GAG GCC TAT ACC AAG CAG AAG TAT CCC GAA CAG CCA GGC AGG 1367 
Thr Leu Glu Ale Tyr Thr Lys Gin Lys Tyr Pro Glu Gin Pro Gly Arg 
35 400 405 410 415 

TTT GCC AAG CTT CTG CTG CGT CTC CCT GCT CTG CGC TCC ATC GGC TTG 1415 
Phe Ala Lys Leu Leu Leu Arg Leu Pro Ale Leu Arg Ser lie Gly Leu 
420 425 430 



AAA TGC CTG GAA CAC CTC TTC TTC TTC AAG CTC ATT GGA GAC ACT CCC 1463 
Lys Cys Leu Glu His Leu Phe Phe Phe Lys Leu lie Gly Asp Thr Pro 
435 440 445 



45 ATC GAC AGC TTC CTC ATG GAG ATG TTG GAG ACC CCA CTG CAG ATC ACC 1511 
He Asp Ser Phe Leu Net Glu Net Leu Glu Thr Pro Leu Gin He Thr 
450 455 460 



TGAACCTCCT CAGCTGCAGC TTC CCC ACC C AGGGTGACCC TTGGGCGGGT GTGTGT6TGT 1571 
GGCCCTACCC TGCACACTCT CCCCC A T CTT CCACTCTGGC CTCCCTTCCT 6TCCCCAAAA 1631 
T6TGATGCTT GTAATAAGCG CCCGCG AATT C 1662 



/ 



35 



10 



15 



20 



25 



30 



35 



40 



45 



SCO 10 N0:6: 

Net Tyr Cly Asn Tyr Ser His Phe Net Lyt Phe Pro Thr Gly Phe Gly 
1 5 1* 15 

Gly Ser Pro Gly Hit Thr Gly Ser Thr Ser Net Ser Pro Ser Val Ala 
20 25 30 v 

Leu Pro Thr Gly Lyt Pro Net Asp Ser Hit Pro Ser Tyr Thr Asp Thr 
35 40 W r 

Pro Val Ser Alt Pro Arg Thr Leu Ser Ale Vol Gly Thr Pro Uu Asn 

50 55 - 60 

Ala Leu Gly Ser Pro Tyr Arg Val I la Thr Ser Ala Net Gly Pro Pro 
65 70 75 80 

Ser Gly Ala Leu Ala Ala Pro Pro Gly He Asn Leu Val Ala Pro Pro 
85 90 95 

Ser Ser Gin Leu Asn Val Val Asn Ser Val Ser Ser Ser Glu Asp Xla 
100 105 110 

Lyt Pro Leu Pro Gly Leu Pro Gly He Gly Asn Net Asn Tyr Pro Ser 
115 120 125 

Thr Ser Pro Gly Ser Leu Val Lys His lie Cys Ala lie Cyt Gly Asp 

1<mj !«» Hu 

Arg Ser Ser Gly Lyt Hit Tyr Gly Val Tyr Ser Cyt Glu Gly Cyt Lyt 
145 150 155 160 

Gly Phe Phe Lys Arg Thr lit Arg Lyt Asp Leu He Tyr Thr Cyt Arg 

165 170 175 

Asp Asn Lys Asp Cyt Leu lie Asp Lyt Arg Gin Arg Asn Aro Cyt Gin 
180 185 190 

Tyr Cyt Arg Tyr Gin Lys Cyt Leu Val Net Gly Net Lyt Aro. Glu Ala 
195 200 205 

Val Gin Glu Glu Arg Gin Arg Ser Arg Glu Arg Ala Glu Ser Glu Ala 
210 215 220 

Glu Cys Ala Ser Ser Ser His Glu Asp Net Pro Val Glu Arg lie Leu 
225 230 235 240 



50 



55 



60 



65 



70 



Glu Ala Glu Leu Ala Val Glu Pro Lyt Thr Glu Ser Tyr Gly Asp Net 
245 250 255 

Asn Val Glu Asn Ser Thr Asn Asp Pro Val Thr Asn He Cyt Hit Ala 
260 265 270 

Ala Asp Lyt Gin Leu Phe Thr Leu Val Glu Trp Ala Lyt Arg lie Pro 
275 280 285 

Hit Phe Ser Asp Leu Thr Leu Glu Asp Gin Val lie Leu Leu Arg Ala 

290 295 300 

Gly Trp Asn Glu Leu Leu lie Ala Ser Phe Ser Rfs Arg Ser Val Ser 

305 310 315 320 

Val Gin Asp Gly lie Leu Leu Ala Thr 6ly Leu His Val His Arg Ser 

325 330 335 

Ser Ala His Ser Arg Gly Val Gly Ser He Phe Asp Arg Val Leu Thr 

340 345 350 

Glu Leu Val Ser Lyt Net Lyt Asp Nat Gin Net Asp Lyt Ser Glu Leu 
355 360 365 

Gly Cyt Leu Arg Ala lie Val Leu Phe Asn Pro Asp Ala Lys Gly Leu 

370 375 380 



36 



Ser Asn Pro Ser Glu Val Glu Thr Leu Arg Clu Lys Val Tyr Ala Thr 

385 390 395 400 

Lau Clu Ala Tyr Thr Lys Gin Lys Tyr Pro Glu Gin Pro Gly Arg Phe 
5 405 410 415 

Ala Lys Lau Lau Leu Arg Leu Pro Ala Leu Arg Ser Ile Gly Leu Lys 
420 425 430 

10 Cys Lau Glu His Leu Pha Phe Phe Lys Lau lie Gly Asp Thr Pro lie 
435 440 445 

Asp Ser Pha Leu Net Glu Net Leu Glu Thr Pro Leu Gin lie Thr 
450 455 460 

15 



.9 



r 
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SCO ID K>:7: 

6AATTCCCCC GAAGCCCAGA CAGCTCCTCC CCAAATCCCC TTTCTCAGGG 6ATCCGTCCG 60 

5 TCTTCTCCTC CTCGCCCACC TCTTACCCCT TCAGCACCTC CACCTCCA ATC CCA CCC 117 

Met Pro Pro 
1 

CCC CCA CT6 GGC TCC CCC TTC CCA GTC ATC ACT T CT TCC ATS KG TCC 165 
10 Pro Pro Uu Gly Ser Pro Phe Pro Vol lit fior Ser Ser Net Cly Ser 
5 10 15 

CCT GGT CT6 CCC CCT CCC GCT CCC CCA GCA TTC TCC GGG CCT fiTC AGC 213 

Pro Cly Lou Pro Pro Pro Ala Pro Pro Cly Phe Ser Gly Pre Vat Ser 
15 20 25 30 35 

AGC CCT CAG ATC AAC TCC ACA GIG TC6 CTC CCT GGG GCT GGG TCT GGC 261 
Ser Pro Gin lie Asn Ser Thr Vol Ser Lou Pro Gly 6ly Gly Ser Gly 
40 <& 50 



20 



40 



60 



CCC CCT GAA GAT GT6 AAC CCA COG CTC TTA GGG 6TC CGG GGC CT6 CAC 309 
Pro Pro Glu Asp Val Lys Pro Pro Val Leu Gly Vol Aro Gly Leu Mia 
55 60 65 



25 T6T CCA CCC CCT CCA GCT GGT CCT GGG GCT GGC AAA CGG CTC TCT GCA 357 
Cys Pro Pro Pro Pro Gly Gly Pro Gly Ala Gly lys Arg Leu Cys Ala 
70 75 80 

ATC TCC GGG GAC CGA AGC TCA GGC AA6 CAC TAT 6GG CTT TAC AGC TCC 405 
30 lie Cys Gly Asp Are Ser Ser Gly Lys Nis Tyr Gly Val Tyr Ser Cys 
85 90 95 

GAG GGC TGC AA6 GGT TTC TTC AA6 CGC ACC ATT CGG AAG GAC CTC ACC 453 
Glu Gly Cys Lys Gly Phe Phe Lys Arg Thr He Arg Lys Asp Leu Tor 
35 100 105 110 115 

TAC TCC TCT CGT GAT AAC AAA GAC TCT ACA CTC 6AC AAG CGC CAC CGG 501 
Tyr Ser Cys Arg Asp Asn Lys Asp Cys Thr Val Asp Lys Arg Gin Arc 
120 125 130 



AAT CGC TGT CAG TAC TCT CGC TAT CAG AAG TGC CTG GCC ACT GGC ATG 549 
Asn Arg Cys Gin Tyr Cys Arg Tyr Gin Lys Cys Leu Ala Thr Gly Net 
135 140 145 



45 AAA AGG GAG GCG GTT CAG GAG GAG CGT CAA CGG GGG AA6 GAC AAA GAC 597 
Lys Arg Glu Ala Val Gin Glu Glu Arg Gin Arg Gly Lys Asp Lys Asp 
150 155 160 

GGG GAT GGA GAT GGG GCT GGG GGA GCC CCT GAG GAC ATG CCT CTC GAC 645 
50 Gly Asp Gly Asp Gly Ala Cly Cly Ala Pro Glu Glu Net Pro Val Asp 
165 170 175 * 

AGG ATC CTC GAC GCA CAG CTT CCT CTG "GAG CAG AAG AGT GAC CAA GGC 693 
Arg He Leu *l« Ala Glu Leu Ala Val Glu Gin Lys Ser Asp Gin Gly 
55 180 185 190 195 

CTT GAG GGT CCT GGG GCC ACC GGC GGT GCT GGC AGC AGC CCA AAT CAC 741 
Val Glu Gly Pro Gly Ala Thr Gly Gly Gly Gly Ser Ser Pro Asn Asp 
200 205 210 



CCA GTC ACT AAC ATC TGC CAG GCA GCT GAC AAA CAG CTG TTC ACA CTC . 789 
Pro Val Thr Asn He Cys Gin Ale Ale Asp Lys Gin Leu Phe Thr Leu 

215 220 v 225 



65 CTT GAG TC6 GCA AAG AGG ATC CCC CAC TTC TCC TCC CTA CCT CTC GAC 837 
Val Glu Trp Ala Lys Are lie Pro His Phe Ser Ser Lou Pro Leu Asp 
230 235 ' 240 

CAT CAC CTC ATA CTG CTG CGG GCA GGC TGC AAC CAG CTC CTC ATT GCG 885 

70 Asp Gin Val Ht Leu Leu Arg Ale Gly Trp Asn Glu Leu Leu He Ala 

245 250 255 
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TCC TTC TCC CAT COG TCC ATT GAT GTC CCA GAT GGC ATC CTC CTG GCC 933 
Ser Phe Ser His Arg Ser lie Asp Val Arg Asp Gly lie Leu Leu Ala 
260 265 270 275 

5 ACS GCT CTT CAT CTG CAC AGA AAC TCA GCC CAT TCC CCA GGC 6T6 GGA 981 
Thr Gly Leu His Vsl His Arg Asn Sar Ala His Ser Ala Gly Val Gly 

280 2S5 290 

GCC ATC TTT GAT CGG CTG CTG ACA GAG CTA GT6 TCC AAA ATG CGT GAC 1029 
10 Ale He Phe Asp Arg Val Leu Thr GLu Leu Val Ser Lys Net Arg Asp 
295 300 305 

ATG AGG ATG GAC AAG ACA GAG CTT GGC TCC CTG CGG GCA ATC ATA CTG 1077 
Net Arg Net Asp Lys Thr Glu Leu Gly Cys Leu Arg Ala lie lie Leu 
15 310 315 320 

TTT AAT CCA GAC GCC AAG 6GC CTC TCC AAC CCT GGA GAG CTG GAG ATC 1125 
Phe Asn Pro Asp Ala Lys Gly Leu Ser Asn Pro Gly Glu Val Glu He 
325 330 335 

20 

CTT CGG GAG AAG CTG TAC GCC TCA CTG GAG ACC TAT TGC AAG CAG AAG 1173 
Leu Arg Glu Lys Val Tyr Ala Ser Leu Glu Thr Tyr Cys Lys Gin Lys 
340 345 350 355 

25 TAC CCT GAG CAG CAG GGC CGG TTT GCC AAG CTG CTG TTA CGT CTT CCT 1221 
Tyr Pro Glu Gin Gin Gly Arg Phe Ala Lys Leu Leu Leu Arg Leu Pro 
360 365 370 

GCC CTC CGC TCC ATC GGC CTC AAG TGT CTG GAG CAC CTG TTC TTC TTC 1269 
30 Ala Leu Arg Ser He Gly Leu Lys Cys Leu Glu His Leu Phe Phe Phe 
375 380 385 

AAG CTC ATT GGC GAC ACC CCC ATT GAC ACC TTC CTC ATG GAG ATG CTT 1317 
Lys Leu He Gly Asp Thr Pro He Asp Thr Phe Leu Net Glu Net Leu 
35 390 395 400 

GAG GCT CCC CAC CAG CTA GCC T6AGCCCAGA TGCACACC6A 6TGTCACTGA 1368 
Glu Ala Pro His Gin Leu Ale 
405 410 

40 

GGAGGACTT6 AGCCTGGGCA CGGGGCA G A 6 CCATGGGACA GGTGCAGAGC AGGACGGGAC 1428 

TTGCCCAGCC TGCCAGGGAT CTGGCAACAC TTAGCACGGT TCGCTTGGTC TCCAAGTCGA 1488 

45 AGGGGACCCT AGATCCCT6T GAG6ACTTTA TGTCTACCTT CAGTGCCCTT GAGTCTCTGA 1548 

ATTTGTCGCC GTCTCCCATG GTGCAGGTGA TTCTTCATCC TGGCTCCCCA GCACAAAGCA 1608 

CTGCCCTGCT TCCTTCTCAT TTCGCCTCAC TCCCTTCTGA AGAGTGGAAC AGAGCTCCCC 1668 

CACAAAGCGC TCTTGTGCGG CAGCCCCCCC AAGCTGATGA TCATGGGAGC AGGGCTCTGA 1728 

CAGCCTTTAT CCTCTCAGAC TTGACAGAT6 6GGGCAGAG6 AGGGACCTGC CTCTGTCTCC 1788 

55 TCTCAGCCCC ATTTCCACAG TCCC TC C TG C AGTCACACT6 AAGAATAAAG GCCTACTGAA 1848 

BGGGCTGCTG GAGGTGGAGG AACCCATTGC TCTTTTAATT TCCT6TGAGG AGAGACT6GG 1908 

AGTTAGACTC AAAGAAGTAC TGTACATCCC CAGGTTGACT TAAATCTCAG GGCTGGAGAT 1968 

GGCATGTGG6 CAAGGA6GCC CCTCAGGTGG GCTGTCCCAA AGCTCCCTGG GCTCTGCCTC . 2028 

6GGT0CCCCT ACAGCTCTTC CCTA6TCTTA AGCACAGCTA GGCTGGGAGC AAGTGGGGAC 2088 

65 ATTGATGGGG CTGGCCAGCC TGCAGAGTTG GGTGCTGGGC TGCATGGTTT TTGCCCTGGA 2148 

C CTCTTTT6 G GGGnCCCTC CCATCTTTCA CTTGCACATA AAGTTGCTTT CCAGTTAAAA 2208 

AAAAAAAAAA A 2219 

70 
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CLAIMS 

That which is claim d is: 

1. A substantially pure DNA sequence which encodes 
5 a polypeptide, wherein said polypeptide is characterized 

being responsive to the presence of retinoid (s) 
to regulate the transcription of associated 
gene(s); 

having a DNA binding domain of about 66 amino 
acids with 9 Cys residues, wherein said DNA 
binding domain has: 

(a) less than about 65 % amino acid identity 
with the DNA binding domain of hRAR-alpha, 

(b) less than about 55 % amino acid identity 
with the DNA binding domain of hTR-beta, 
and 

(c) less than about 55 % amino acid identity 
with the DNA binding domain of hGR; and 

not including the sequence set forth in 
Sequence ID No 7. 

2. A DNA sequence* according to Claim 1 wherein said 
25 polypeptide is encoded by a continuous sequence which 

encodes substantially the same sequence as that of: 
amino acids 1 - 462 shown in Sequence ID No. 2 
[hRXR-a] , 

amino acids 1-467 shown in Sequence ID No. 4 
30 [mRXR-o], or 

amino acids 1 - 463 shown in Sequence ID No. 6 

[mRXR-y] • 



(1) 



10 (2) 



15 



20 



(3) 



35 



3. A DNA sequenc according to Claim 1 wherein said 
polypeptide is enc d d by a continuous sequence which 
encodes substantially th same sequence as that of: 
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amino acids 135 - 200 sh wn in Sequenc ID No. 2 
[hRXR-oc] , 

amino acids 140 - 205 shown in Sequenc ID No. 4 
[mRXR-o] , or 

5 amino acids 139 - 204 shown in Sequence ID No. 6 

[mRXR-y] . 

4. A DNA sequence according to Claim 1 which 
comprises a segment having a continuous nucleotide 

10 sequence which is substantially the same as: 

nucleotides 76 - 1464 shown in Sequence ID No. 1 
[hRXR-a] , 

nucleotides 181 - 1581 shown in Sequence ID No. 2 
[mRXR-a], or 

15 nucleotides 123 - 1514 shown in Sequence ID No. 3 

[mRXR-y ] • 

5. A DNA sequence according to Claim 4 which is 
pSK(hRXR-alpha) , pSK(mRXR-alpha) , or pSK(mRXR-gamma) . 

20 

6. A substantially pure DNA construct comprising: 

(i) the DNA sequence of Claim 1 operatively linked 
to 

(ii) regulatory element(s) operative for 

25 transcription of said DNA sequence and 

expression of said polypeptide in an animal 
cell in culture. 

7. A DNA construct according to Claim 6 which is 
30 selected from A5C-hRXR-alpha, A5C-mRXR-alpha , 

A5C-mRXR-gamma, RS-hRXR-alpha, RS-mRXR-alpha, or 
RS-mRXR-gamma . 



8. An animal cell in culture which is transformed 
35 with a DNA construct acc rding to Claim 6. 
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9. A cell according to Claim 8 wherein said cell is 
an insect cell or a mammalian cell. 

10. A cell according to Claim 9 wherein the DNA 
5 construct is selected from A5C-hRXR-alpha, A5C-mRXR- 

alpha, A5C-mRXR-gamma, RS-hRXR-alpha , RS*mRXR-alpha, or 
RS-mRXR-gamma. 

11. A cell according to Claim 8, wherein said cell 
10 is further transformed with a reporter vector which 

comprises: 

(a) a promoter that is operable in said cell, 

(b) a hormone response element, and 

(c) a DNA segment encoding a reporter protein, 

15 wherein said reporter protein-encoding DNA 

segment is operatively linked to said promoter 
for transcription of said DNA segment, and 

wherein said hormone response element is 
operatively linked to said promoter for 

20 activation thereof. 

12. A cell according to Claim 11 wherein: 

the promoter is the 5»-LTR promoter of a mouse 
mammary tumor virus, 
25 the hormone response element is selected from 

TREp or beta-RARE, and 

the reporter protein is selected from 
chloramphenicol acetyl transferase, lucif erase, or 
beta-gal actosidase.' 



30 



13. A cell according to Claim 12 wherein the 
reporter vector is- selected from delta -MTV-TRE p -CAT, 
delta-TK-TRE p -CAT , delta-SV-TRE p -CAT , delta-MTV-TRE p -LUC , 
delta-TK-TREp-UJC , or d lta-SV-TRE p -LUC . 



35 



WO 91/12258 



42 

14. A cell according to Claim 12 wher in the 
rep rter vector is selected from ADH-TRE p -CAT, ADH-TRE p - 
LUC, TK-TRE p ~CAT , or TK-TRE p -LCC. 

5 15. A cell according to Claim 14 which is a 

Drosophila melanogaster Schneider line 2 cell. 

16. A method of testing a compound for its ability 
to regulate transcription-activating effects of a 
10 receptor polypeptide, said method comprising assaying for 
the presence or absence of reporter protein upon 
contacting of cells containing a receptor polypeptide and 
reporter vector with said compound; 

wherein said receptor polypeptide is characterized 

15 by: 

(1) being responsive to the presence of retinoid (s) 
to regulate the transcription of associated 
gene(s); and 

(2) having a DNA binding domain of about 66 amino 
20 acids with 9 Cys residues, wherein said DNA 

binding domain has: 

(a) less than about 65 % amino acid identity 
with the DNA binding domain of hRAR-alpha, 

25 (b) less than about 55 % amino acid identity 

with the DNA binding domain of hTR-beta, 
and 

(c) less than about 55 % amino acid identity 
with the DNA binding domain of hGR, and 
30 wherein said reporter vector comprises: 

(a) a promoter that is operable in said cell, 

(b) a hormone response element, and 

(c) a DNA s gment ncoding a report r protein, 
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wh r in said reporter protein-encoding DNA 
segm nt is operatively linked to said promoter 
for transcription of said DNA segment, and 

wherein said hormone response element is 
5 operatively linked to said promoter for 

activation thereof* 

17 • A method according to Claim 16 wherein said 
contacting is carried out in the further presence of at 
10 least one retinoid species. 

18. A method according to Claim 16 wherein the 
cells employed are CV-1 cells co-transf orrwd with a 
vector capable of expressing said receptor polypeptide, 
15 wherein said vector is. selected from RS-hRXR-alpha, RS- 
mRXR-alpha, or RS-mRXR-gamma and a reporter vector 
selected from delta-MTV-TRE p -CAT, delta-TK-TRE p -CAT, 
delta-S V-TRE p -CAT , del ta-MTV-TRE p -LCC , del ta-TK-TRE p -LUC , 



20 



or delta-SV-TRE p -LUC. 



19. A method according to Claim 16 wherein the 
cells employed are Drosophila melanogaster Schneider line 
2 cells co-transformed with a vector capable of 
expressing said receptor polypeptide, wherein said vector 

25 is selected from A5C-hRXR-alpha , A5C-mRXR-alpha , or A5C- 
mRXR-gamma, and a reporter vector selected from ADH-TRE p - 
CAT, ADH-TRE p -LUC, TK-TRE p -CAT, or TK-TRE p -LUC. 

20. A labeled single-stranded nucleic acid 

30 sequence, comprising at ieast 20 contiguous bases in 

length having substantially the same sequence as any 20 
or more contiguous bases selected" from: 

(i) bases 2 - 1861, inclusive, of the DNA 

illustrated in Sequ nc ID No. 1 [hRXR-a], or 
35 (ii) bas s 20 - 2095 . inclusiv , of the DNA 

illustrated in Sequence ID No. 2 [mRXR-a], or 
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(iii) bases 15 - 1653, inclusive, of the DNA 
illustrated in Sequence ID No. 3 [mRXR-y], or 

(iv) the compl ment of any one of the sequences 
according to (i) , (ii) , or (iii). 

5 

21. A nucleic acid according to Claim 20 which is 
labelled with S P. 

22. A method of making a receptor polypeptide, 
10 wherein said polypeptide is characterized by: 

(1) being responsive to the presence of retinoid (s) 
to regulate the transcription of associated 

•3 «-»•«- v-/ ' 

(2) having a DNA binding domain of about 66 amino 
15 acids with 9 Cys residues, wherein said DNA 

binding domain has: 

(a) less than about 65 % amino acid identity 
with the DNA binding domain of hRAR-alpha, 

20 (b) less than about 55 % amino acid identity 

with the DNA binding domain of hTR-beta, 
and 

(c) less than about 55 % amino acid identity 
with the DNA binding domain of hGR; 
25 said method comprising culturing cells containing an 

expression vector operable in said cells to express a DNA 
sequence encoding said polypeptide. 

23. A method according to Claim 22 wherein said 

30 receptor polypeptide has substantially the same sequence 
as that of: 

amino acids 1-462 shown in Sequence ID No. 2 
[hRXR-o], 

amino acids 1-467 shown in sequence ID N . 4 
35 [mRXR-o] , r 
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amino acids 1 - 463 shown in S quence ID No. 6 
[mRXR-y] . 

24. A method according to Claim 22 wherein said 

5 receptor polypeptide comprises a DNA binding domain with 
substantially the same sequence as that of: 

amino acids 135 - 200 shown in Sequence ID No. 2 
[hRXR-a] , 

amino acids 140 - 205 shown in Sequence ID No. 4 
10 [mRXR-a], or 

amino acids 139 - 204 shown in Sequence ID No. 6 
[mRXR-y]. 

25. A method according to Claim 22 wherein said DNA 
15 sequence comprises a segment with substantially the same 

nucleotide sequence as that of: 

nucleotides 76 - 1464 shown in Sequence ID No. 1 
[hRXR-a], 

nucleotides 181 - 1581 shown in Sequence ID No. 2 
20 [mRXR-a] , or 

nucleotides 123 - 1514 shown in Sequence ID No. 3 

[mRXR-y]. 



mRXRa 



hRARa 



hER 



hTRp 



hGR 



140 205230 



467 



88 153 198 



462 



185 250 305 



53%* 




595 





102 169 232 


456 








421 486 528 


777 


f 











2/9 





mRXRa 



mRXRp 



mRXRy 



140 205 230 



467 



82 147171 



139 204 229 



36% 



410 



79Tt? 




96X 



135 200 225 



100X 



100% 



462 




Tig-. 3 



4/9 

FIGURE 1^ 



120 




M, Retinoic Acid 



5/9 




m 

8 
I 



T 
u> 



< 
en 



.11 
I 

I 



RP 

ROH 

RAC 

RAL 

RA 

* 

I 



cc 

X 

cc 



1 



(UOISJaAUOQ 1BIU|XB1A| %) 



t— — i — — r 
S 5 8 



(UOlSJeAUOQ IBUI1XBIAI %) 



7/9 



FIGURE "7 




M, Retlnoic Acid 



8/9 



150 n 




9/9 



b 




M,ddRA 



INTERNATIONAL SEARCH REP RT 



I. CLASSIFICATION Of SUBJECT MATTER -f - w^on , rn „ ,r.. 

, . .soiqmq l0 inirrnai.onji p.iien CuiwicaMA MPCi or 19 oatn Nat.onji CusfiMcai.on .ma if»C 

! ..F.C.-5:: C07K 15 12: C07K 3/00: C120 1/66: C12S 15/OC 
: V.S. CI.: 536/27; 530 '250: &35'6: 935 '7* . 75 



•I FIELDS SEARCHED 



C>.»StitiCAI-on Svmo" 



536; 27: 530/35— A35/6: 935/77.75 



Documentation Saarcnt o otfttr than Minimum Documentation 
to tn* Enent trvi! such Oocumtnis art Inclvotd in tn# Ft*fdt S«arcn+d • 



Gen Bank, EMBL 



III. DOCUMENTS CONSIDERED TO BE RELEVANT* 



Cauoofjr • I C ijtion ol Oeumtnt, " w»w «ooicauow. ***** aoofoownta. 01 tn« rottvant oastaaes ^ «nt i 3 c.um r,.- ; 



X ' Nature. Vol. 337. issued 09 February 1989, Giguere 
ec ai, "Spatial and Temporal Expression of the 
! Retinoic Acid Recepter in the Regenerating Amphibian 
Limb', pages 566-569, see especially Figure 1. 

X Nature. n Vol. 330. issued 17 December 1987, Giguere 

ec al. "Identification of a Receptor for the 
i Morphogen Retinoic Acid", pages 624-629, see 
! especially Figure 1. 

i 

X : Nature. Vol. 331, issued 07 January 1988, Giguere 
| et al. "Identification of a New Class of Steroid 
Hormore Receptors", pages 91-94, see especially 
Figure 1. 



1-25 



1-25 



i 



' Sotcai cjteoo**M 01 cteo oocu«»#ms 

"A** U0Cunu»ni tlH,« t nn m rw« n %■*!•• c* * .in *• •** $ 

"t" 9**f*t ocKu^«»ni t . r r-t'-^pt c~ ■• vc» t » 

i.».nn o 

L 

* • • • 

"0 docu*»«"*« »j i- et- ■ *• . 

titr.«* i* 

doc«*w^*'« •■ i ••••»»• »• . 

Utn !•♦.•« t- t. .:.»t.- , , • - t 



•tin* ,i**um»«t n»iMi<r.* ( i 

•.. » • . UlHlV' %!.»*»«1 !• •• «»• •• 

-. ».l .. ■ 

:.. ♦ * . ■• ••• • 

♦ ;. f. ■»•••• • • • • 

..... • ■ • f » • ' 



IV. CERTIFICAT1 N 



Oatt ot trie AciwJi w.-m h uh « 

U March 1991 



L> i!v r» M ••••«. i «.! I" •• !••«••»■• 



• (i ... «i • # j • 



ln!*fiut«4M.u *. «• 

ISA/US 




2 MAY 1991 



ndy B. Fleisher 



